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ABSTRACT 

This paper focuses on cosmological constraints derived from analysis of WMAP data alone. 

A simple ACDM cosmological model fits the five-year WMAP temperature and polarization 
data. The basic parameters of the model are consistent with the three- year data and now better 
constrained: tt b h 2 = 0.02273 ± 0.00062, fl c h 2 = 0.1099 ± 0.0062, Ov = 0.742 ± 0.030, n s = 
0.963io.oi5> r = 0-0^7 ± 0.017, ag = 0.796 ± 0.036. With five years of polarization data, we 
have measured the optical depth to reionization, r > 0, at 5cr significance. The redshift of an 
instantaneous reionization is constrained to be z re ion = 11.0 ± 1.4 with 68% confidence. The 2a 
lower limit is z re ion >8.2, and the 3a limit is z re i on >6.7. This excludes a sudden reionization 
of the universe at z = 6 at more than 3.5<r significance, suggesting that reionization was an 
extended process. Using two different methods for polarized foreground cleaning, and foreground 
marginalization, we get consistent estimates for the optical depth. This cosmological model 
also fits small-scale CMB data, and a range of astronomical data measuring the expansion rate 
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and clustering of matter in the universe. We find evidence for the first time in the CMB power 
spectrum for a non-zero cosmic neutrino background, or a background of relativistic species, with 
the standard three light neutrino species preferred over the best-fit ACDM model with iV e ff = 0 
at > 99.5% confidence, and N e ^ > 2.3 (95% CL) when varied. The five-year WMAP data 
improve the upper limit on the tensor-to-scalar ratio, r < 0.43 (95% CL), for power-law models, 
and halve the limit on r for models with a running index, r < 0.58 (95% CL). With longer 
integration we find no evidence for a running spectral index, with dn s /d\nk — — 0.037 ± 0.028, 
and find improved limits on isocurvature fluctuations. The current WMAP-only limit on the sum 
of the neutrino masses is ^ tyi v < 1.3 eV (95% CL), which is robust, to within 10%, to a varying 
tensor amplitude, running spectral index or dark energy equation of state. 

Subject headings: cosmic microwave background, cosmology: observations, polarization, early 
universe 


1. Introduction 

The Wilkinson Microwave Anisotropy Probe (WMAP), launched in 2001, has mapped out the Cosmic 
Microwave Background with unprecedented accuracy over the whole sky. Its observations have led to the 
establishment of a simple concordance cosmological model for the contents and evolution of the universe, 
consistent with virtually all other astronomical measurements. The WMAP first-year and three-year data 
have allowed us to place strong constraints on the parameters describing the ACDM model, a flat universe 
filled with baryons, cold dark matter, neutrinos, and a cosmological constant, with initial fluctuations de- 
scribed by nearly scale-invariant power law fluctuations, as well as placing limits on extensions to this simple 
model (Spergel et al. 2003, 2007). With all-sky measurements of the polarization anisotropy (Kogut et al. 
2003; Page et al. 2007), two orders of magnitude smaller than the intensity fluctuations, WMAP has not only 
given us an additional picture of the universe as it transitioned from ionized to neutral at redshift z ~ 1100, 
but also an observation of the later reionization of the universe by the first stars. 

In this paper we present cosmological constraints from WMAP alone, for both the ACDM model and 
a set of possible extensions. We also consider the consistency of WMAP constraints with other recent 
astronomical observations. This is one of seven five-year WMAP papers. Hinshaw et al. (2008) describe the 
data processing and basic results, Hill et al. (2008) present new beam models and window functions, Gold 
et al. (2008) describe the emission from Galactic foregrounds, and Wright et al. (2008) the emission from 
extra-Galactic point sources. The angular power spectra are described in Nolta et al. (2008), and Komatsu 
et al. (2008) present and interpret cosmological constraints based on combining WMAP with other data. 

WMAP observations are used to produce full-sky maps of the CMB in five frequency bands centered at 
23, 33, 41, 61, and 94 GHz (Hinshaw et al. 2008). With five years of data, we are now able to place better 
limits on the ACDM model, as well as to move beyond it to test the composition of the universe, details 
of reionization, sub-dominant components, characteristics of inflation, and primordial fluctuations. We have 
more than doubled the amount of polarized data used for cosmological analysis, allowing a better measure 
of the large-scale E-mode signal (Nolta et al. 2008). To this end we describe an alternative way to remove 
Galactic foregrounds from low resolution polarization maps in which Galactic emission is marginalized over, 
providing a cross-check of our results. With longer integration we also better probe the second and third 
acoustic peaks in the temperature angular power spectrum, and have many more year-to-year difference 
maps available for cross-checking systematic effects (Hinshaw et al. 2008). 
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The paper is structured as follows. In §2 and §3 we focus on methodology, starting in §2 by describing a 
new method for cleaning the polarization maps using Gibbs sampling, and presenting estimated component 
maps for CMB, synchrotron, and dust emission. These are used to cross-check the standard template-cleaned 
results. The CMB likelihood and parameter estimation methodology is described in §3. We describe a fast 
method for computing the large-scale temperature likelihood, based on work described in Wandelt et al. 
(2004), which also uses Gibbs sampling, and outline more efficient techniques for sampling cosmological 
parameters. In §4 we present cosmological parameter results from five years of WMAP data for the ACDM 
model, and discuss their consistency with recent astronomical observations. Finally we consider constraints 
from WMAP alone on a set of extended cosmological models in §5, and conclude in §6. 


2. Bayesian estimation of low resolution polarization maps 

The three-year WMAP observations (Page et al. 2007) showed that polarized diffuse emission from our 
Galaxy dominates the primordial signal over much of the sky, particularly at the lower frequencies probed 
by WMAP. Accurate estimation of the CMB signal at large angular scales therefore poses a significant 
challenge. In this section, we present results from a new method for estimating the polarized CMB signal, 
which serves both as a cross-check of the template cleaning method used by WMAP, and as a framework in 
which errors due to foreground uncertainty are more rigorously propagated. We start by briefly reviewing 
our understanding of diffuse polarized Galactic components in the microwave regime. A more detailed review 
was made in Page et al. (2007). 

Both synchrotron and thermal dust emission are polarized to some degree due to the Galactic magnetic 
field, measured to have a coherent spiral structure parallel to the Galactic plane, as well as a significant 
turbulent component (Spitzer 1998; Beck 2001; Vallee 2005; Han 2006). The effective strength of the field is 
of order ~ 10 //G, and is thought to be split roughly equally between the coherent and turbulent components 
(Crutcher et al. 2003). Synchrotron emission is produced by relativistic cosmic-ray electrons accelerated in 
this magnetic field (see Strong et al. (2007) for a review of cosmic ray propogation) . For electrons with a 
power law distribution of energies 

N(E) oc FT p , (1) 

the frequency dependence of the emission is characterized by antenna temperature T{y) ex with spectral 
index (3 = — (p + 3)/2, with typically (3 ~ — 3 (Rybicki & Lightman 1979). However, since synchrotron loss is 
proportional to P 2 , this means that older sources of electrons should have a lower energy distribution and a 
steeper spectral index of synchrotron emission, compared to regions of recently injected electrons. This leads 
to a synchrotron index that varies over the sky (Lawson et al. 1987; Reich & Reich 1988) and is expected 
to steepen away from the Galactic plane (Strong et al. 2007), with evidence of this behavior seen in the 
WMAP data (Bennett et al. 2003). Since the cosmic-ray electrons emit radiation almost perpendicular to 
the Galactic magnetic field in which they orbit, they can produce polarization fractions as high as ~ 75% 
(Rybicki & Lightman 1979), although integration of multiple field directions along a line of sight reduces this 
level. The fractional polarization observed at radio frequencies in the range 408 MHz - 2.4 GHz is further 
lowered due to Faraday rotation (Duncan et al. 1995; Uyaniker et al. 1999; Wolleben et al. 2006), but the 
integrated signal observed by WMAP at 23 GHz reaches upper limits of ~ 50% (Kogut et al. 2007). 

Thermal emission of sub-micron sized dust grains, which absorb starlight radiation and re-emit thermally 
in the far infra-red with typical spectral index (3 ~ 2, is the dominant Galactic component at the highest 
frequencies probed by WMAP. Its intensity has been well measured by the IRAS and COBE missions and 
extrapolated to microwave frequencies by Finkbeiner et al. (1999). Polarization arises since grains tend 
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to align their long axes perpendicular to the Galactic magnetic field via, for example, the Davis-Greenstein 
mechanism (Davis & Greenstein 1951), and depending on their composition can be polarized up to a modeled 
maximum of ~ 20% parallel to their long axes (e.g., Hildebrand h Dragovan (1995)). Observations of 
starlight, polarized perpendicular to the dust grains, are consistent with this picture (Heiles 2000; Berdyugin 
et al. 2001), as are the three-year WMAP observations (Page et al. 2007; Kogut et al. 2007). A population 
of smaller dust grains formed of polycyclic aromatic hydrocarbons may also emit a significant amount of 
microwave radiation due to electric dipole rotational emission (Draine & Lazarian 1999; Draine &; Li 2007). 
This question is discussed in e.g. Hinshaw et al. (2007); Dobler & Finkbeiner (2007); Gold et al. (2008) 
with respect to the intensity signal observed by WMAP. However, these small spinning dust grains are not 
expected to be significantly polarized (Draine 2003). Other mechanisms for producing polarized emission, 
including magnetic dust (Draine & Lazarian 1999), have not been observed to be dominant. 

Given these two polarized foreground components, the standard method used to clean the WMAP 
polarization maps involves subtracting synchrotron and dust template maps from the total, leaving a cleaned 
CMB map at the Ka, Q, and V bands (Page et al. 2007; Gold et al. 2008). The templates are assumed to be 
exact, such that the noise properties of the cleaned CMB maps are the same as the uncleaned maps, although 
the error is increased to account for the small CMB signal present in the synchrotron template (Page et al. 
2007). This method has proved satisfactory for cosmological interpretation, but only the central two bands 
(Q and V) were used in the three-year WMAP analysis. To confidently use the low frequency WMAP bands 
we want an independent cross-check of the result, and ideally an improved method for marginalizing over 
foreground uncertainty. The method described here uses a Bayesian framework for estimating low resolution 
CMB, synchrotron, and dust maps. We use maps with HEALPix A r S id e = 8 16 , with pixels of side 7.2 
degrees. The main products are maps, and a covariance matrix, of the polarized CMB Q and U components, 
marginalized over foreground uncertainty. A similar technique has also been developed by Eriksen et al. 
(2006, 2007) for estimating intensity maps, and has been applied to the three-year WMAP temperature 
maps. We provide a description of our method and the main results in this paper, but further details of the 
method and application to simulated maps are given in a separate paper (Dunkley et al. 2008). 


2.1. Map estimation method 

We model the total polarized emission in antenna temperature at frequency v as 

m„ = /( v)A c + (v/v k )P°A. s + (u/u w )^ d A.d + n„, (2) 

where is a vector of length 2 N p containing the Stokes Q and U signal for the CMB, synchrotron and 
dust components. The amplitude vectors of each component are given by A x , where x = c, s, d for CMB, 
synchrotron and dust respectively, defined at vk — 22.8 GHz for CMB and synchrotron, and vw — 93.5 
GHz for dust. The Gaussian noise, n^, is quantified by the 2N P x 2N p noise matrix at each band, NT, which 
includes pixel-pixel covariance and Q-U covariance. The spectral indices of the synchrotron and dust are 
given by /3 X , and f(v) converts thermodynamic temperature to antenna temperature (see Hinshaw et al. 
(2008)). Given the observed data d^, the joint probability distribution of the parameters A x and (3 X can be 
written as p(A, /3\d) = p(d|A, /3)p(A, /3), with prior distribution p(A, (3) and Gaussian likelihood 

—2 lnp(d|A, (3) = ^(d„ - m„) T N i ; 1 (d„ - m„) + c, (3) 


16 The number of pixels is 12iV2. de , where iV S ide = 2 3 for r3, or resolution 3 (Gorski et al. 2005). 
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with a normalization that appears as an additive term c in the log- likelihood. The objective is to sample 
from this distribution, to obtain the marginalized distribution for the CMB amplitudes, 

p(A c |d) = J p(A,f3\d)dA s dA d d/3, (4) 

as well as estimates for the Galactic components. The method is applied only at low resolution, for N p = 768. 
We form the data vectors from the four lowest frequency WMAP bands (K, Ka, Q, V), giving 8 N p data 
points. We start with the co-added maps and N _1 matrices for each band at HEALPix iV sic je = 16. We then 
degrade the maps and inverse noise matrices to N S [d e = 8, using the full noise matrices. 

In the general case the spectral indices could vary pixel-by-pixel, and could take different values for 
Q and U. A more general modeling would also allow for non power-law indices. However, for the dust 
component we fix f3 d = 1.7 for all pixels, motivated by Finkbeiner et al. (1999). We show in Appendix 
A that changing this to (3 d — 2 has no effect on our results. We allow the polarized synchrotron index 
to vary spatially, but fix the Q and U index to be the same for a given pixel. We also define N t < N p 
regions of the sky within which f3 s takes a common value, rather than allow it to take a unique value at each 
pixel, and impose a Gaussian prior of f3 s = —3.0 ± 0.15 on the index in each region. We do not allow for a 
steepening of the index in our fiducial model. These priors are motivated both by our noise levels and by our 
understanding of the emission process: even though we expect spatial variation due to the different ages of 
the electron populations, the electron diffusion rate limits how much the index can vary over a short range 
(Strong et al. 2000, 2007). We choose Ni = 30 in the fiducial model, with the choice of regions discussed 
further in Appendix A. The prior on the index range is rather tight, as indices of -2.7 and -3.3 are physically 
reasonable, and a more conservative prior would be preferred. However, we have found this to be necessary 
for numerical stability, and check that modifying the central value to (3 S = —2.8 has little effect on our 
estimated CMB signal. We make no prior assumptions about the CMB or synchrotron Q and U amplitudes 
at each pixel, but impose a prior that the dust be < 20% polarized, using the dust map 1^ at 94 GHz from 
Finkbeiner et al. (1999), hereafter FDS, as a tracer of the intensity. We additionally impose a Gaussian prior 
on the dust Stokes parameters of [Qd(n), Ud(n)\ = 0 ± 0.2 /^(n). This is sufficiently broad that moving the 
central value of the prior to e.g. 5% of the dust map (rather than 0%), has a negligible effect on results. 

The distribution is therefore described by 6N P amplitudes and Ni synchrotron spectral indices. We 
cannot sample the joint distribution p(A,/3|d) directly, so we use Markov Chain Monte Carlo methods to 
draw samples from it. It can be sliced into two conditional distributions p(A\/3,d) and p(/3|A, d), so we 
can use Gibbs sampling to draw alternately from each conditional distribution, constructing a Markov chain 
with the desired joint distribution as its stationary distribution. The sampling would go as follows for the 
case of one A parameter and one /3 parameter: we start from some arbitrary point (A*, /3{) in the parameter 
space. Then we draw 

(^t+l, A+l), (Ai+2, A+2)--- (5) 

by first drawing Ai+i from p(A|$, d) and then drawing /%+ 1 from p(/3\A i+ i,d). Then we iterate many times. 
The result is a Markov chain whose stationary distribution is p(A,/3\d). A description of Gibbs sampling 
can be found in Gelfand & Smith (1990), and in the cosmological literature in Wandelt et al. (2004); Eriksen 
et al. (2007). 

For the multivariate case A and (3 are now vectors, and so each vector is drawn in turn. For fixed /3, the 
conditional distribution p(A\/3, d) is a 6iV p -dimensional Gaussian, so it is possible to directly draw a sample 
of all 6N p amplitude parameters simultaneously. The mean and variance of such a sample for a given /3 
vector are derived in Dunkley et al. (2008). It is a simple step, but computationally intensive. For fixed A, 
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we cannot invert the distribution p(/3 |A,d) to draw a sample. Instead we use the Metropolis algorithm to 
sample /3, which has only Ni elements. The Metropolis algorithm has been described in detail in e.g., Neal 
(1993); Lewis & Bridle (2002); Dunkley et al. (2005). 

This combination of sampling steps is commonly known as Metropolis-within-Gibbs (e.g. Geweke & 
Tanizaki (2001)), and has been used in astronomy to estimate Cepheid distances (Barnes et al. 2003). The full 
sampling, drawing a new A and then a new f3 in turn until convergence, produces marginalized distributions 
for the CMB, synchrotron, and dust amplitudes, and synchrotron indices. We form maps of each component 
from the mean of the marginalized distribution, 


(Ax) = 


/ 


p( A X |d) A X (LA. X 


— TK, 


( 6 ) 


where the sum is over all uq elements in the chain, and A l x is the ith chain element. The covariance 
matrices for A x , including off-diagonal terms, are estimated using the same method, summing over the chain 
components. In this analysis we work with low resolution maps with N p = 768, since the computational 
costs of working at higher resolution are large and our main scientific goal is to determine the polarized 
CMB signal at large angular scale. Tests of this method on simulated maps, constructed with three- year 
WMAP noise properties, are described in Dunkley et al. (2008). The CMB and Galactic foreground maps, 
and their associated cosmological parameters, are recovered with the expected y 2 between input and output 
components. 


2.2. Results: Low resolution polarization maps 

Figures 1 and 2 show maps of the mean values and la errors for the marginalized CMB, synchrotron, 
and dust amplitudes. We first consider the CMB results. The noise patterns for both Q and U in Figure 2 are 
consistent with what we expect: in regions of low Galactic emission, the errors are dominated by instrumental 
noise. As the Galactic plane is approached, errors from foreground uncertainty begin to dominate, in 
particular where the dust contribution is most uncertain. As opposed to template cleaning, which produces 
CMB maps at each frequency observed, this method recovers a single Q and U polarization map, and so 
has higher signal-to- noise than any of the individual template-cleaned maps. There is some indication of 
structure in the CMB signal, particularly in the Q Stokes map at the Galactic anti-center, and in the U 
map in the region of the North Polar Spur. While this could indicate some Galactic contamination, it is 
consistent with noise. Outside the P06 mask the maps are morphologically similar to the template-cleaned 
CMB maps co-added over Ka, Q, and V bands. 

To quantify the polarization anisotropy present in the Q and U maps, we use the coordinate-independent 
scalar and pseudo-scalar E and B modes commonly used in cosmological analysis (Seljak 1997; Kamionkowski 
et al. 1997; Page et al. 2007). This decomposition provides a useful way of separating the primordial 
perturbation modes, since scalar, or density, perturbations give rise only to T modes and the curl-free E 
modes, while tensor perturbations produce T, E and divergence- free B modes. Both polarization modes 
probe the evolution of the decoupling and reionization epochs and are generated by Thomson scattering of 
a quadrupolar radiation pattern by free electrons. The anisotropy is quantified using the Cj E , C EE , Cf B 
power spectra, where 


CF = (olA:.)- 


(7) 
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Fig. 1. — Low resolution polarized Q (top) and U (bottom) maps of the CMB, synchrotron, and dust 
emission, estimated from the five-year K, Ka, Q, and V band maps using Gibbs sampling. Pixels inside the 
processing mask are grey. The CMB maps (left panels, thermodynamic temperature) do not show significant 
Galactic contamination in the plane. The synchrotron amplitudes (center, antenna temperature), are defined 
at K-band (22.8 GHz), and are consistent with the total K-band maps, with high Q and U emission from 
the North Polar Spur, and high Q emission in the Galactic plane at longitude 110 l ;$ 170. The dust 
amplitudes (right, antenna temperature) are defined at W-band (93.5 GHz), and have a prior that bounds 
P = y/Q 2 + U* to be less than 20% of the dust intensity in each pixel. 

The spin-2 decomposition of the polarization maps, is related to the Q and U maps by 

e 

[Q±iU](x) = EE =F2^m =F2F^ m (x) (8) 

£> 1 m=—i 

where ± 2 dim = af m ± iafm (Zaldarriaga k, Seljak 1997). 

The optical depth to reionization is the parameter that has most effect on the low-^ E-mode signal. After 
decoupling there are no free electrons to scatter the CMB until the first stars reionize the universe at redshift 
z rj scattering the intrinsic CMB quadrupole to produce a polarized signal. Its size is oc C 2 (^ r ) 1 / 2 r(z r ), where 
r is the optical depth, r(z) — car fj n e (z')dz' (dt / dz') , with <jt the Thompson cross section, c the speed of 
light and n e the free electron density. Reionization therefore has a distinctive signature at £ < 10 in the EE 
power spectrum, proportional to r 2 . 

The measure of r is one way to compare the Gibbs-cleaned maps produced here, with the template- 
cleaned maps from the main analysis (Gold et al. 2008). An agreement between the two methods does not 
guarantee that the inferred signal on the sky is the same using both methods, but is a necessary consistency 
check. We compute r using the exact likelihood technique described in Appendix D of Page et al. (2007), with 
the amplitude of the first temperature acoustic peak fixed, and all other cosmological parameters fixed at 
best-fit ACDM values. The full marginalized covariance matrix is used, including pixel-pixel correlations. We 
find t = 0.100 ±0.017 for the Gibbs-cleaned maps, which is remarkably consistent with the results obtained 
through template cleaning, which give r = 0.088 ±0.016 for the KaQV data combination. Obtained using a 
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Fig. 2. — Estimated la errors on the low resolution maps of the CMB (left), synchrotron (center), and 
dust (right) Q and U components, as shown in Figure 1. The CMB errors are more fully described by 
a covariance matrix, including pixel-pixel correlation and Q/U correlation, so the maps can be used for 
cosmological analysis. The errors on the dust maps (right) are dominated by the prior that limits the dust 
polarization fraction to 20%. The middle panels clearly show the two sources of uncertainty in our CMB 
polarization maps: detector noise in the ecliptic plane (which traces a sideways S in the map) and foreground 
removal uncertainties in the galactic plane. 

different methodology and more fully accounting for foreground marginalization, this adds to our confidence 
in the detection of the CMB E-mode polarization signal, and to the errors assigned to the optical depth. 

We then perform a set of tests where we test the dependence of the estimated optical depth, r, on 
the prior assumptions made in the Gibbs-cleaning method, reported in Appendix A. We find that changing 
the dust spectral index to (3d = 2 has no effect on the results, nor does changing the location of the 30 
regions in the synchrotron spectral index mask, or reducing it to 15 regions. Changing the central value of 
the prior to —2.8 gives a consistent r = 0.103 ± 0.018. Changing the upper limit on the dust polarization 
fraction to 15% has no effect, but loosening it to 50% and 100% increasingly broadens the distribution and 
lowers the central value, but the results are consistent. When we include W-band in the analysis we find 
r = 0.093 =b 0.016. However, for cosmological purposes we prefer not to use W band, as the residual power 
from template-cleaning indicates a potential systematic error (Hinshaw et al. 2008). 

We also consider the goodness of fit of the fiducial model to the five-year data. Using this Bayesian 
method it is not easy to compare the best-fit values to those from template-cleaned maps. The sampling 
provides a marginalized estimate of the CMB map, with errors, rather than locating a single best-fit point. 
The maximum-likelihood model has \ 2 = 1.03 per data point, summed over the four bands, u (d u — 
m l/ ) T N“ 1 (d„ — m *,), for the full-sky. We would not expect this model to have \ 2 = 1 per degree of freedom, 
as thousands of parameters are being drawn simultaneously with the Gibbs algorithm, so the exact \ 2 
peak does not get drawn. We compare the fit to simulations, which give a maximum-likelihood model with 
X 2 — 1.01 per data point, for simulated data whose properties match the model. The real data therefore 
provides a comparably good fit. 
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Fig. 3. — Left: The estimated synchrotron polarization amplitude P = \fQ 2 + U 2 in antenna temperature 
at K-band, consistent with the total K-band signal in Hinshaw et al. (2008). Right two panels: The estimated 
synchrotron spectral index (center), and la errors (right), estimated in 30 regions of the sky. A Gaussian 
prior of f3 s = — 3.0 =b 0.15 is imposed in each region. In regions of low signal-to-noise (near the ecliptic poles), 
the prior drives the spectral index estimate, but the North Polar Spur and Galactic anti-center prefer a 
steeper index of ~ —3.1, given the prior. 


In the main parameter analysis described later in this paper we use the template-cleaned polarization 
maps in the main pipeline, as this method has been subjected to more rigorous cross-checks and null tests. 
However, we check the consistency of ACDM parameters using both methods. 


2.2.1. Synchrotron and dust polarization 

The synchrotron maps shown in Figure 1 are similar to the total K-band maps (Hinshaw et al. 2008), 
with the total polarized amplitude shown in Figure 3. The difference between the estimated synchrotron 
amplitude, and the K-band amplitude, is < 5 pK outside the P06 mask, and < 8 pK in the Galactic plane. 
This is not surprising, as the synchrotron emission is expected to dominate at 23 GHz. As observed in the 
three-year WMAP data (Page et al. 2007), the signal is dominated by emission from the North Polar Spur, 
marked on the microwave sky map in Hinshaw et al. (2007), as well as what is often known as the ‘Fan region’ 
(e.g., Wolleben et al. (2006)), centered on Galactic coordinates l ~ 140, b ~ 5. The synchrotron emission 
dominates the signal at low frequencies, and so the uncertainty in the synchrotron Q and U maps, shown 
in Figure 2, is dominated by instrument noise, with only a small additional contribution from component 
marginalization in the Galactic plane. Figure 3 shows the mean synchrotron spectral index estimated in the 
30 regions of the sky, together with la errors. Consistent maps are found for alternative choices of index 
regions. The prior —3.0 =b 0.15 prevents us from drawing strong conclusions, but there is a preference in the 
North Polar Spur and Fan region for a steeper index of ~ —3.1. In regions of low synchrotron signal-to-noise 
the index is driven by the prior. There is no clear trend of the index steepening off the plane, in contrast to 
what is observed in the synchrotron intensity (Bennett et al. 2003). This is consistent with a recent analysis 
by Miville-Deschenes et al. (2008), and is the signature of a low polarization component emitting at low 
frequencies. This component could be an additional non-synchrotron component in the inferred intensity 
map. It could also be a component whose signal is depolarized, such that the observed polarized signal along 
a line of sight comes from a physically separate region. Higher signal-to-noise measurements, and lower 
frequency observations, will help resolve the issue. 

The dust polarization map has a low signal-to-noise ratio, particularly far from the plane, as we only 
fit data in the K-V bands, excluding W band. In these regions the prior dominates the estimate of the 
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dust amplitude, making it hard to draw conclusions about the dust component. The error in both Q and 
U is driven by the prior on the polarization ampitudes and so is morphologically identical to the FDS dust 
intensity map. This explains why the error far from the plane is so low even though the dust can only be 
poorly measured there. The fractional polarization outside P06 is typically only 1-2%, where we use the 
degraded FDS dust map to trace intensity, and rises to ~10% in some regions of the plane. This is lower 
than the ~ 4% estimated in (Kogut et al. 2007; Gold et al. 2008). However, these maps are only estimated 
for v < 61 GHz, and in regions of low dust the prior will tend to prefer zero polarization. The fractional 
polarization estimate also assumes that the FDS map accurately traces the dust intensity in the WMAP 
frequency range. Inclusion of higher frequency data will tell us more about the polarized dust, and the 
analysis including W-band will be presented in a future release. 


3. Likelihood and parameter estimation methodology 
3.1. Likelihood 

The WMAP likelihood function takes the same format as for the three-year release, and software im- 
plementation is available on LAMBDA (http://lambda.gsfc.nasa.gov) as a standalone package. It takes in 
theoretical CMB temperature (TT), E-mode polarization (EE), B-mode polarization (BB), and temperature- 
polarization cross-correlation (TE) power spectra for a given cosmological model. It returns the sum of 
various likelihood components: \ow-£ temperature, \ow-£ TE/EE/BB polarization, high-^ temperature, high- 
£ TE cross-correlation, and additional terms due to uncertainty in the WMAP beam determination, and 
possible error in the extra-galactic point source removal. There is also now an additional option to compute 
the TB and EB likelihood. The main improvement in the five-year analysis is the implementation of a faster 
Gibbs sampling method for computing the £ < 32 TT likelihood, which we describe in Section 3.1.1. 

We continue to evaluate the exact likelihood for the polarization maps at low multipole, £ < 23, as 
described in Appendix D of Page et al. (2007). The input maps and inverse covariance matrix used in the 
main analysis are produced by co-adding the template-cleaned maps described in Gold et al. (2008). For 
testing, these are substituted by the Gibbs-cleaned map and inverse covariance matrix described in Section 
2 of this paper. In both cases these are weighted to account for the P06 mask using the method described 
in Page et al. (2007). In the three-year analysis we conservatively used only the Q and V bands in the 
likelihood. We are now confident that Ka band is cleaned sufficiently for inclusion in analyses (see Hinshaw 
et al. (2008) for a discussion). 

For £ > 32, the TT likelihood uses the combined pseudo-Q spectrum and covariance matrix described 
in Hinshaw et al. (2007), estimated using V and W bands. We do not use the EE or BB power spectra at 
£ > 23, but continue to use the TE likelihood described in Page et al. (2007), estimated using Q and V bands. 
The errors due to beam and point sources are treated the same as in the three-year analysis, described in 
Appendix A of Hinshaw et al. (2007). A discussion of this treatment can be found in Nolta et al. (2008). 


3.1.1. Low-£ temperature likelihood 

For a given set of cosmological parameters with theoretical power spectrum Ci, the likelihood function 
returns p(d\Ce), the likelihood of the observed map d, or its transformed aim coefficients. Originally, the 
likelihood code was written as a hybrid combination of a normal and lognormal distribution (Verde et al. 
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2003). This algorithm did not properly model the tails of the likelihood at low multipoles (Efstathiou 2004; 
Slosar et al. 2004; O’Dwyer et al. 2004; Hinshaw et al. 2007), and so for the three-year data the £ < 30 
likelihood was computed exactly, using 

r \ exp[(— (l/2)d T C -1 d] 

mc,) = ’ <9) 

where C is the covariance matrix of the data including both the signal covariance matrix and noise C (Ci) = 
S(C^) 4- N (Hinshaw et al. 2007). This approach is computationally intensive however, since it requires the 
inversion of a large covariance matrix each time the likelihood is called. 

In Wandelt et al. (2004) a faster method was developed to compute p(d\Ci), which we now adopt. It is 
described in detail in that paper, so we only briefly outline the method here. The method uses Gibbs sampling 
to first sample from the joint posterior distribution p(Ce, s|d), where Ct is the power spectrum and s is the 
true sky signal. From these samples, a Blackwell-Rao (BR) estimator provides a continuous approximation 
to p(Ce\d). When a flat prior, p(Ce) = const , is used in the sampling, we have p(Ce |d) oc p(d\Ce), where 
the constant of proportionality is independent of Ce. The BR estimator can then be used as an accurate 
representation of the likelihood, p(d\Ce) (Wandelt et al. 2004). 

The first step requires drawing samples from p(C^,s|d). We cannot draw samples from the joint dis- 
tribution directly, but we can from the two conditional distributions p(s\Ce,d) and p(C^|s,d), each a slice 
through the (N p x ^ max )-dimensional space. Samples are drawn alternately, forming a Markov Chain of 
points by the Gibbs algorithm, as introduced in Section 2: 

s i+1 - p^icj.d) 

Ci +l - P(C? |s i+ \d) 

This samples the full distribution p(Ce, s|d). 

The first conditional distribution is a multivariate Gaussian with mean S l (S* + N) _1 d and variance 
[(S*) -1 H- N —1 ] — 1 , so at each step a new signal vector s 1-1 " 1 , of size N p , can be drawn. This is computationally 
demanding however, as described in Wandelt et al. (2004), requiring the solution of a linear system of 
equations Mv = w, with M = 1 4- S 1 / 2 N _1 S 1 / 2 . These are solved at each step using the conjugate gradient 
technique, which is sped up by finding an approximate inverse for M, a preconditioner. This requires 
computation of the inverse noise matrix, N _1 in spherical harmonic space, which is done by computing the 
components of N _1 term by term using spherical harmonics in pixel space. There are more efficient ways 
to compute N _1 (Hivon et al. 2002; Eriksen et al. 2004), but computing the preconditioner is a one-time 
expense, and it is only necessary to compute harmonics up to £ — 30. 

The second conditional distribution, p(Ce |s, d) is an inverse Gamma distribution, from which a new Ce 
vector of size £ max can be rapidly drawn using the method in Wandelt et al. (2004). Sampling from these 
two conditional distributions is continued in turn until convergence, at which point the sample accurately 
represents the underlying distribution. This is checked in practice using a jacknife test that compares 
likelihoods from two different samples. 

Finally, once the joint distribution p(Q, s|d) has been pre-computed, the likelihood for any given model 
Ce is obtained by marginalizing over the signal s, p(d\Cg) oc f p(C^,s|d)ds, which holds for a uniform prior 
distribution p(Ce). In practice this is computed using the Blackwell-Rao estimator, 

p(d\C e ) oc f p(C e \s)p(s\d)ds « — ^p(C f |s') 
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Fig. 4. — This figure compares the low-£ TT power spectrum computed with two different techniques. At 
each £ value, we plot the maximum likelihood value (tic mark), the region where the likelihood is greater 
than 50% of the peak value (thick line) and the region where the likelihood is greater than 95% of the peak 
value (thin line). The black lines (left side of each pair) are estimated by Gibbs sampling using the ILC 
map smoothed with a 5 degree Gaussian beam (at HEALPIX iV side = 32). The light blue line (right side of 
the pair) is estimated with a pixel-based likelihood code with iV side = 16. The slight differences between the 
points are primarily due to differences in resolution. At each multipole, the likelihood is sampled by fixing 
the other Ce values at a fiducial spectrum (red). 


where the sum is over all uq samples in the Gibbs chain. Since p(Ce\s l ) = p(Ce\a l e ), where at = (2£ + 
l) -1 \ s £rn\ 2 i an d sem are the spherical harmonic coefficients of s, one only needs to store cr} at each 
step in the Gibbs sampling. Then, each time the likelihood is called for a new Ce, one computes C = 
P(Ce\ a i)/ n G> This requires only 0(£ max ^G) computations, compared to the full O(Np) evaluation of 
equation 9. This speed-up also means that the exact likelihood can be used to higher resolution than is 
feasible with the full evaluation, providing a more accurate estimation. 


Code details : Choice of £ limits, smoothing, and resolution 

The code used for WMAP is adapted from the MAGIC Gibbs code described in Wandelt (2003); Wandelt 
et al. (2004). The input temperature map is the five-year ILC map described in Gold et al. (2008). Because 
the Blackwell- Rao estimator works poorly in low signal-to-noise regions, we did not attempt to use it for the 
polarization data. To produce correct results, the Gibbs sampler requires an accurate model of the data. 
This means that the signal covariance matrix S (Ct) cannot be approximated to be zero except for multipoles 
where the smoothing make the signal much less than the noise. For the full WMAP data set, this would 
require sampling out to £ ~ 1000, with A^ s ide = 512. This is computationally expensive, taking more than of 
order 10 4 processor-hours to converge (O’Dwyer et al. 2004). Instead we reduce the resolution and smooth 
the data to substantially reduce the required multipole range, speeding up the computation. 
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The ILC map is smoothed to 5.0 degree FWHM, and sampled at AT side = 32. The process of smoothing 
the data has the side effect of correlating the noise. Correlated noise slows down the Gibbs sampling, as it 
takes longer to draw a sample from p(s|C*,d). We therefore add uncorrelated white noise to the map such 
that it dominates over the smoothed noise. However, the added noise must not be so large that it changes 
the likelihood of the low-£ modes; cosmic variance must remain dominant over the noise (Eriksen et al. 2007), 
so we add 2 fiK of noise per pixel. In Appendix B the noise power spectra are shown. 

The Gibbs sampler converges much more slowly in regions of low signal to noise. Because of this, we 
only sample spectra out to £ = 51 and fix the spectrum for 51 < £ < 96 to a fiducial value, and set it to zero 
for £ > 96. The BR estimator is only used up to £ — 32 for cosmological analysis, so we marginalize over the 
32 < £ < 51 spectrum. The likelihood is therefore p(L\d) = f p(L, H\d)dH, where L and H refer to the low, 
£ < 32, and higher, 32 < £ < 51, parts of the power spectrum. Examination of the BR estimator show it to 
have a smooth distribution. Tests of the results to various input choices, including choice of resolution, are 
given in Appendix B. We note that by using the \ow-£ likelihood only up to £ < 32, this breaks the likelihood 
into a low and high £ part, which introduces a small error by ignoring the correlation between these two 
parts of the spectrum. However, this error is small, and it is unfeasible, in a realistic sampling time, to use 
the BR estimator over the entire £ range probed by WMAP. 

In Figure 4 we show slices through the Ct distribution obtained from the BR estimator, compared to 
the pixel likelihood code. The estimated spectra agree well. Some small discrepancies are due to the pixel 
code using iV side = 16, compared to the higher resolution A^ide = 32 used for the Gibbs code. 


3.2. Parameter Estimation 


We use the Markov Chain Monte Carlo methodology described in Spergel et al. (2003, 2007) to explore 
the probability distributions for various cosmological models, using the five-year WMAP data and other 
cosmological data sets. We map out the full distribution for each cosmological model, for a given data 
set or data combination, and quote parameter results using the means and 68% confidence limits of the 
marginalized distributions, with 



d N xL(d\x)p(x)xi 


3 = 1 


(13) 


where xj is the jth value of the zth parameter in the chain. We also give 95% upper or lower limits when 
the distribution is one-tailed. We have made a number of changes in the five-year analysis, outlined here 
and in Appendix C. 


We parameterize our basic ACDM cosmological model in terms of the following parameters: 

(w&, w c , fi A , t , n s , A^j}, {Asz} (14) 

defined in Table 1. is the amplitude of curvature perturbations, and n s the spectral tilt, both defined 
at pivot scale ko = 0.002/Mpc. In this simplest model we assume instantaneous reionization of the universe, 
with optical depth r. The contents of the Universe, assuming a flat geometry, are quantified by the baryon 
density , the cold dark matter (CDM) density uj c and a cosmological constant, We treat the con- 
tribution to the power spectrum by Sunyaev-Zeldovich fluctuations as in Spergel et al. (2007), adding the 
predicted template spectrum from Komatsu & Seljak (2002), multiplied by an amplitude Asz , to the total 
spectrum. We limit 0 < Asz < 2 and impose unbounded uniform priors on the remaining six parameters. 
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Parameter Description 

c Jb Baryon density, Q.^h 2 

uj c Cold dark matter density, Q. c h 2 

Dark energy density, with w = — 1 unless stated 
Amplitude of curvature perturbations at ko = 0.002/Mpc 
n s Scalar spectral index at ko = 0.002/Mpc 

r Reionization optical depth 

Asz SZ marginalization factor 

dn s /d\nk Running in scalar spectral index 

r Ratio of the amplitude of tensor fluctuations to scalar fluctuations 

Oi-i Fraction of anti-correlated CDM isocurvature (See Sec 5.1.3) 

a o Fraction of uncorrelated CDM isocurvature (See Sec 5.1.3) 

N e fi Effective number of relativistic species (assumed neutrinos) 

ujv Massive neutrino density, Q^h 2 

Spatial curvature, 1 — f2 to t 

w Dark energy equation of state, w = Pde/ PDE 

Yp Primordial Helium fraction 

x e Ionization fraction of first step in two-step reionization 


z r Reionization redshift of first step in two-step reionization 

<78 Linear theory amplitude of matter fluctuations on 8 h -1 Mpc scales 

Ho Hubble expansion factor (lOOh Mpc -1 km s -1 ) 

Y^rriu Total neutrino mass (eV) ^rriu = 94 Q u h 2 
n m Matter energy density + f2 c + 

Clmh 2 Mattfer energy density 

to Age of the universe (billions of years) 

Zreion Redshift of instantaneous reionization 
7710 Ratio of baryon to photon number densities, 10 10 (ri5/n 7 ) = 273.9 Q^h 2 


Table 1: Cosmological parameters used in the analysis, http://lambda.gsfc.nasa.gov lists the marginalized values for 
these parameters for all of the models discussed in this paper. 


We also consider extensions to this model, parameterized by 

{dn s /d In /c, r, a_i, a 0 , ^Ik, w, uj u , 7V eff , Yp, x e , z r } (15) 

also defined in Table 1. These include cosmologies in which the primordial perturbations have a running 
scalar spectral index dn s /d\nk , a tensor contribution with tensor- to- sc alar ratio r, or an anti-correlated 
or uncorrelated isocurvature component, quantified by ct_i, cto- They also include models with a curved 
geometry SI*,, a constant dark energy equation of state w, and those with massive neutrinos = 

94 £t u h 2 eV, varying numbers of relativistic species iV e ff, and varying primordial Helium fraction Yp. There 
are also models with non- instantaneous ‘two-step’ reionization as in Spergel et al. (2007), with an initial 
ionized step at z r with ionized fraction x e , followed by a second step at 2 = 7 to a fully ionized universe. 

These parameters all take uniform priors, and are all sampled directly, but we bound 7V e ff < 10, w > 
—2.5, z r < 30 and impose positivity priors on r, a_i, ao, Y p , and $1 a, as well as requiring 0 < x e < 1. 
The tensor spectral index is fixed at nt = — r/8. We place a prior on the Hubble constant of 20 < Ho < 100, 
but this only affects non-flat models. Other parameters, including erg, the redshift of reionization, z r eion, and 
the age of the universe, to, are derived from these primary parameters and described in Table 1. A more 
extensive set of derived parameters are provided on the LAMBDA website. 
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Fig. 5. — The temperature angular power spectrum corresponding to the WMAP-only best-fit ACDM model. 
The grey dots are the unbinned data; the black data points are binned data with lcr error bars including 
both noise and cosmic variance computed for the best-fit model. 


We continue to use the CAMB code (Lewis et al. 2000) to generate the CMB power spectra for a 
given set of cosmological parameters. Given the improvement in the WMAP data, we have determined that 
distortions to the spectra due to weak gravitational lensing should now be included. We use the lensing 
option in CAMB which roughly doubles the time taken to generate a model, compared to the unlensed case. 

We have made a number of changes in the parameter sampling methodology. Our main pipeline now 
uses an MCMC code originally developed for use in Bucher et al. (2004), which has been adapted for WMAP. 
For increased speed and reliability, it incorporates two changes in the methodology described in Spergel et al. 
(2007). It uses a modified sampling method that generates a single chain for each model (instead of the four, 
or eight, commonly used in cosmological analyses). We also use an alternative spectral convergence test 
that can be run on a single chain, developed in Dunkley et al. (2005), instead of the Gelman h Rubin test 
used in Spergel et al. (2007). These are both described in Appendix C. We also use the publicly available 
CosmoMC sampling code (Lewis & Bridle 2002) as a secondary pipeline, used as an independent cross-check 
for a limited set of models. 


4. The ACDM Cosmological Model 
4.1. WMAP five-year parameters 

The ACDM model, described by just six parameters, is still an excellent fit to the WMAP data. The 
temperature and polarization angular power spectra are shown in Nolta et al. (2008). With more observation 
the errors on the third acoustic peak in the temperature angular power spectrum have been reduced. The 
TE cross-correlation spectrum has also improved, with a better measurement of the second anti-correlation 
at £ ~ 500. The low-£ signal in the EE spectrum, due to reionization of the universe, is now measured with 
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Parameter 

3 Year Mean 

5 Year Mean 

5 Year Max Like 

ioojm 2 

2.229 ± 0.073 

2.273 ± 0.062 

2.27 

n c h 2 

0.1054 ±0.0078 

0.1099 ±0.0062 

0.108 

Da 

0.759 ± 0.034 

0.742 ± 0.030 

0.751 

n s 

0.958 ±0.016 

0.963^0015 

0.961 

T 

0.089 ± 0.030 

0.087 ±0.017 

0.089 

<1 

(2.35 ±0.13) x 10“ 9 

(2.41 ±0.11) x 10“ 9 

2.41 x 10 -9 

08 

0.761 ±0.049 

0.796 ± 0.036 

0.787 

D rn 

0.241 ± 0.034 

0.258 ± 0.030 

0.249 

D m h 

0.128 ±0.008 

0.1326 ±0.0063 

0.131 

Hi o 

73.21^ 

71.9^;? 

72.4 

^reion 

11.0 ±2.6 

11.0 ± 1.4 

11.2 

to 

13.73 ±0.16 

13.69 ±0.13 

13.7 


Table 2: ACDM model parameters and 68% confidence intervals from the five-year WMAP data alone. The three- 
year values are shown for comparison. For best estimates of parameters, the marginalized ‘Mean’ values should be 
used. The ‘Max Like’ values correspond to the single model giving the highest likelihood. 


higher significance (Nolta et al. 2008). The best-fit 6 parameter model, shown in Figure 5, is successful in 
fitting three TT acoustic peaks, three TE cross-correlation maxima/minima, and the \ow-£ EE signal. The 
model is compared to the polarization data in Nolta et al. (2008). The consistency of both the temperature 
and polarization signals with ACDM continues to validate the model. 

The five-year marginalized distributions for ACDM, shown in Table 2 and Figures 6 and 7, are consistent 
with the three-year results (Spergel et al. 2007), but the uncertainties are all reduced, significantly so for 
certain parameters. With longer integration of the large-scale polarization anisotropy, there has been a 
significant improvement in the measurement of the optical depth to reionization. There is now a 5cr detection 
of r, with mean value r = 0.087±0.017. This can be compared to the three-year measure of r = 0.089 ±0.03. 
The central value is little altered with two more years of integration, and the inclusion of the Ka band data, 
but the limits have almost halved. This measurement, and its implications, are discussed in Sec 4.1.1. 

The higher acoustic peaks in the TT and TE power spectra also provide more information about the 
ACDM model. Longer integration has resulted in a better measure of the height and position of the third 
peak. The highest multipoles have a slightly higher mean value relative to the first peak, compared to the 
three-year data. This can be attributed partly to improved beam modeling, and partly to longer integration 
time reducing the noise. The third peak position constrains D^ 275 /i (Page et al. 2003), while the third peak 
height strongly constrains the matter density, f } m h 2 . In this region of the spectrum, the WMAP data are 
noise-dominated so that the errors on the angular power spectrum shrink as 1/t. The uncertainty on the 
matter density has dropped from 12% in the first year data to 8% in the three year data and now 6% in the 
five year data. The CDM density constraints are compared to three-year limits in Figure 6. The spectral 
index still has a mean value 2.5cr less than unity, with n s = 0.963^015 • This continues to indicate the 
preference of a red spectrum consistent with the simplest inflationary scenarios (Linde 2005; Boyle et al. 
2006), and our confidence will be enhanced with more integration time. 

Both the large scale EE spectrum and the small scale TT spectrum contribute to an improved measure 
of the amplitude of matter fluctuations. With the CMB we measure the amplitude of curvature fluctuations, 
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Fig. 6. — Constraints on ACDM parameters from the five-year WMAP data. The two-dimensional 68% 
and 95% marginalized limits are shown in blue. They are consistent with the three-year constraints (grey). 
Tighter limits on the amplitude of matter fluctuations, erg, and the cold dark matter density f£ c /i 2 , arise from 
a better measurement of the third temperature (TT) acoustic peak. The improved measurement of the EE 
spectrum provides a 5cr detection of the optical depth to reionization, r, which is now almost uncorrelated 
with the spectral index n s . 




quantified by A^, but we also derive limits on <r 8 , the amplitude of matter fluctuations on 8/i _1 Mpc scales. 
A higher value for r produces more overall damping of the CMB temperature signal, making it somewhat 
degenerate with the amplitude, A^, and therefore erg. The value of erg also affects the height of the acoustic 
peaks at small scales, so information is gained from both temperature and polarization. The five-year data 
give <7g = 0.796 ± 0.036, slightly higher than the three-year result, driven by the increase in the amplitude 
of the power spectrum near the third peak. The value is now remarkably consistent with new measurements 
from weak lensing surveys, as discussed in Section 4.2. 


4-1.1. Reionization 

Our observations of the acoustic peaks in the TT and TE spectrum imply that most of the ions and 
electrons in the universe combined to make neutral hydrogen and helium at 2 : ~ 1100. Observations of 
quasar spectra show no Gunn-Peterson trough at z > 5.8 (Fan et al. 2000, 2001) implying that the universe 
was nearly fully ionized by z = 5.7. How did the universe make the transition from being nearly fully 
neutral to fully ionized? The astrophysics of reionization has been a very active area of research in the past 
decade. Several recent reviews (Barkana & Loeb 2006; Fan et al. 2006; Furlanetto et al. 2006; Meiksin 2007) 
summarize the current observations and theoretical models. Here, we highlight a few of the important issues 
and discuss some of the implications of the WMAP measurements of optical depth. 
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Fig. 7. — Constraints from the five-year WMAP data on ACDM parameters (blue), showing marginalized 
one-dimensional distributions and two-dimensional 68% and 95% limits. Parameters are consistent with the 
three-year limits (grey) from Spergel et al. (2007), and are now better constrained. 
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Fig. 8. — Left: Marginalized probability distribution for z r eion in the standard model with instantaneous 
reionization. Sudden reionization at 2 = 6 is ruled out at 3.5cr, suggesting that reionization was a gradual 
process. Right: In a model with two steps of reionization (with ionization fraction x e at redshift 2 r , followed 
by full ionization at 2 = 7), the WMAP data are consistent with an extended reionization process. 


What objects reionized the universe? While high redshift galaxies are usually considered the most likely 
source of reionization, AGNs may also have played an important role. As galaxy surveys push towards ever 
higher redshift, it is unclear whether the known population of star forming galaxies at 2 ~ 6 could have 
ionized the universe (see e.g., Bunker et al. (2007)). The EE signal clearly seen in the WMAP five-year data 
(Nolta et al. (2008); §2) implies an optical depth, r ~ 0.09. This large optical depth suggests that higher 
redshift galaxies, perhaps the low luminosity sources appearing in z > 7 surveys (Stark et al. 2007), played 
an important role in reionization. While the known population of AGNs can not be a significant source 
of reionization (Bolton & Haehnelt 2007; Srbinovsky & Wyithe 2007), an early generation of supermassive 
black holes could have played a role in reionization (Ricotti & Ostriker 2004; Ricotti et al. 2007). This early 
reionization would also have an impact on the CMB. 

Most of our observational constraints probe the end of the epoch of reionization. Observations of 2 > 6 
quasars (Becker 2001; Djorgovski et al. 2001; Fan et al. 2006; Willott et al. 2007) find that the Lyman-a 
optical depth rises rapidly. Measurements of the afterglow spectrum of a gamma ray burst at z = 6.3 (Totani 
et al. 2006) suggest that universe was mostly ionized at 2 — 6.3. Lyman alpha emitter surveys (Taniguchi 
et al. 2005; Malhotra & Rhoads 2006; Kashikawa et al. 2006; lye et al. 2006; Ota et al. 2007) imply a 
significant ionized fraction at 2 = 6.5. The interpretation that there is a sudden change in the properties of 
the IGM remains a subject of active debate (Becker et al. 2007; Wyithe et al. 2008). 

The WMAP data place new constraints on the reionization history of the universe. The WMAP data best 
constrains the optical depth due to reionization at moderate redshift (z < 25) and only indirectly constrains 
the redshift of reionization. If reionization is sudden, then the WMAP data implies that 2 re ion = 11.0 ± 1.4, 
shown in Figure 8, and now excludes 2 re ion — 6 at more than 99.9% CL. The combination of the WMAP data 
implying that the universe was mostly reionized at 2 ~ 11 and the measurements of rapidly rising optical 
depth at 2 ~ 6 — 6.5 suggest that reionization was an extended process rather than a sudden transition. Many 
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Fig. 9. — Effect of foreground treatment and likelihood details on ACDM parameters. Left: The number 
of bands used in the template cleaning (denoted ‘T’), affects the precision to which r is determined, with 
the standard KaQV compared to QV and KaQVW, but has little effect on other cosmological parameters. 
Using maps cleaned by Gibbs sampling (KKaQV (G)) also gives consistent results. Right: Lowering the 
residual point source contribution (‘lower ptsrc’) and removing the marginalization over an SZ contribution 
(‘No SZ’) affects parameters by < 0.4cr. Using a larger mask (‘80% mask’) has a greater effect, increasing 
Vt^h 2 by 0.5cr, but is consistent with the effects of noise. 


early studies of reionization envisioned a rapid transition from a neutral to a fully ionized universe occurring 
as ionized bubbles percolate and overlap. As Figure 8 shows, the WMAP data suggests a more gradual 
process with reionization beginning perhaps as early as z ~ 20 and strongly favoring z > 6. This suggests 
that the universe underwent an extended period of partial reionization . The limits were found by modifying 
the ionization history in CAMB to include two steps in the ionization fraction at late times ( z < 30): the 
first at z r with ionization fraction x e , the second at z = 7 with x e = 1. Several studies (Cen 2003; Chiu et al. 
2003; Wyithe &; Loeb 2003; Haiman &; Holder 2003; Yoshida et al. 2004; Choudhury &; Ferrara 2006; Iliev 
et al. 2007; Wyithe et al. 2008) suggest that feedback produces a prolonged or perhaps even, multi-epoch 
reionization history. 

While the current WMAP data constrain the optical depth of the universe, the EE data does not yet 
provide a detailed constraint on the reionization history. With more data from WMAP and upcoming data 
from Planck, the EE spectrum will begin to place stronger constraints on the details of reionization (Kapling- 
hat et al. 2003; Holder et al. 2003; Mortonson &; Hu 2008). These measurements will be supplemented by 
measurements of the Ostriker-Vishniac effect by high resolution CMB experiments which is sensitive to 
f n 2 dt (Jaffe & Kamionkowski 1998). 
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4.1.2. Sensitivity to foreground cleaning 

As the E-mode signal is probed with higher accuracy, it becomes increasingly important to test how 
much the constraint on r, z re [ on , and the other cosmological parameters, depend on details of the Galactic 
foreground removal. Tests were done in Page et al. (2007) to show that r was insensitive to a set of variations 
in the dust template used to clean the maps. In Figure 9 we show the effect on ACDM parameters of changing 
the number of bands used in the template-cleaning method: discarding Ka band in the C QV’ combination, 
or adding W band in the ‘KaQVW’ combination. We find that r (and therefore z re ion) is sensitive to the 
maps, but the dispersion is consistent with noise. As expected, the error bars are broadened for the QV 
combined data, and the mean value is r = 0.080 =b 0.020. When W band is included, the mean value is 
r = 0.100 ± 0.015. We choose not to use the W band map in our main analysis however, as noted in Section 
2.1, because there appears to be excess power in the cleaned map at t — 7. This indicates a potential 
systematic error, and is discussed further in Hinshaw et al. (2008). The other cosmological parameters are 
only mildly sensitive to the number of bands used. This highlights the fact that r is no longer as strongly 
correlated with other parameters, as in earlier WMAP data (Spergel et al. 2003, 2007), notably with the 
spectral index of primordial fluctuations, n s (Figure 6). 

We also test the parameters obtained using the Gibbs-cleaned maps described in Section 2, which use the 
K, Ka, Q, and V band maps. Their distributions are also shown in Figure 9, and have mean r = O.lOOiLO.018. 
This is less than lcr higher than the KaQV template-cleaned maps but uses a independent method. The 
other cosmological parameters are changed by less than 0.3cr compared to the template-cleaned results. This 
consistency gives us confidence that the parameter constraints are little affected by foreground uncertainty. 


4-1.3. Sensitivity to likelihood details 

The likelihood code used for cosmological analysis has a number of variable components that have 
been fixed using our best estimates. Here we consider the effect of these choices on the five-year ACDM 
parameters. The first two are the treatment of the residual point sources, and the treatment of the beam 
error, both discussed in Nolta et al. (2008). The multi-frequency data are used to estimate a residual point 
source amplitude of A ps = 0.011 ± 0.001 ^xK 2 sr, which scales the expected contribution to the cross-power 
spectra of sources below our detection threshold. It is defined in Hinshaw et al. (2007); Nolta et al. (2008), 
and is marginalized over in the likelihood code. The estimate comes from QVW data, whereas the VW data 
give 0.007 ± 0.003 /iK 2 sr, both using the KQ85 mask described in Hinshaw et al. (2008). The right panels in 
Figure 9 shows the effect on a subset of parameters of lowering A ps to the VW value, which leads to a slightly 
higher n Sl fl c h 2 and as, all within OAa of the fiducial values, and consistent with more of the observed high-^ 
signal being due to CMB rather than unresolved point sources. We also use A ps = 0.011 /xK 2 sr with no 
point source error, and find a negligible effect on parameters (< O.lcr). The beam window function error is 
quantified by ten modes, and in the standard treatment we marginalize over them, following the prescription 
in Hinshaw et al. (2007). We find that removing the beam error also has a negligible effect on parameters. 
This is discussed further in Nolta et al. (2008), who considers alternative treatments of the beam and point 
source errors. 

The next issue is the treatment of a possible contribution from Sunyaev Zeldovich fluctuations. We 
account for the SZ effect in the same way as in the three-year analysis, marginalizing over the amplitude 
of the contribution parameterized by the Komatsu-Seljak model (Komatsu h Seljak 2002). The parameter 
Asz is unconstrained by the WMAP data, but is not strongly degenerate with any other parameters. In 
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Figure 9 we show the effect on parameters of setting the SZ contribution to zero. Similar to the effect of 
changing the point source contribution, the parameters depending on the third peak are slightly affected, 
with a < 0.25cr increase in n s , Q c h 2 , cr 8 and similar decrease in baryon density. 

Another choice is the area of sky used for cosmological interpretation, or how much we mask out to 
account for Galactic contamination. Gold et al. (2008) discuss the new masks used for the five-year analysis, 
with the KQ85 mask used as standard. We test the effect of using the more conservative KQ80 mask, and 
find a more noticeable shift. The quantity Pl^h 2 is increased by 0.5a, and n s , fl c h 2 and a 8 all decreased 
by ~ 0.4(j. This raised concerns that the KQ85 mask contains residual foreground contamination, but as 
discussed in Nolta et al. (2008), this shift is found to be consistent with the effects of noise, tested with 
simulations. We also confirm that the effect on parameters is even less for ACDM models using WMAP with 
external data, and that the choice of mask has only a small effect on the tensor amplitude, raising the 95% 
confidence level by ~5%. 

Finally, we test the effect on parameters of varying aspects of the low-^ TT treatment. These are 
discussed in Appendix B, and in summary we find the same parameter results for the pixel-based likelihood 
code compared to the Gibbs code, when both use £ < 32. Changing the mask at low-^ to KQ80, or using 
the Gibbs code up to £ < 51, instead of £ < 32, has a negligible effect on parameters. 


4.2. Consistency of the ACDM model with other data sets 

While the WMAP data alone place strong constraints on cosmological parameters, there has been a 
wealth of results from other cosmological observations in the last few years. These observations can gen- 
erally be used either to show consistency of the simple ACDM model parameters, or to constrain more 
complicated models. In this section we describe various current astronomical data sets that provide cosmo- 
logical information. A subset of the data is used to place combined constraints on extended cosmological 
models in Komatsu et al. (2008). Here we compare a much broader set of data to the WMAP ACDM model. 


4.2.1. Small-scale CMB measurements 

A number of recent CMB experiments have probed smaller angular scales than WMAP can reach and 
are therefore more sensitive to the higher order acoustic oscillations and the details of recombination. Since 
the three-year WMAP analysis, there have been new temperature results from the Arcminute Cosmology 
Bolometer Array Receiver (ACBAR), both in 2007 (Kuo et al. 2007) and in 2008 (Reichardt et al. 2008). 
They have measured the angular power spectrum at 145 GHz to 5’ resolution, over ~600 deg 2 . Their results 
are consistent with the model predicted by the WMAP five-year data, shown in Figure 10, although ACBAR 
is calibrated using WMAP, so the data are not completely independent. 

Figure 10 also shows data from the BOOMERANG, CBI and VSA experiments, which agree well with 
WMAP. There have also been new observations of the CMB polarization from two ground-based experiments, 
QUaD, operating at 100 GHz and 150 GHz (P. Ade et al. 2007), and CAPMAP, at 40 GHz and 100 GHz 
(Bischoff et al. 2008). Their measurements of the EE power spectrum are shown in Nolta et al. (2008), 
together with detections already made since 2005 (Leitch et al. 2005; Sievers et al. 2007; Barkats et al. 2005; 
Montroy et al. 2006), and are all consistent with the ACDM model parameters. 

In our combined analysis in Komatsu et al. (2008) we use two different data combinations. For the first 
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Multipole moment l 

Fig. 10. — The best-fit temperature angular power spectrum from WMAP alone (red), is consistent with 
data from recent small-scale CMB experiments: ACBAR, CBI, VS A and BOOMERANG. 


we combine four data sets. This includes the 2007 ACBAR data (Kuo et al. 2007), using 10 bandpowers in 
the range 900 <^<2000. The values and errors were obtained from the ACBAR web site. We also include the 
three external CMB data sets used in Spergel et al. (2003): the Cosmic Background Imager (CBI; Mason 
et al. (2003); Sievers et al. (2003); Pearson et al. (2003); Readhead et al. (2004)), the Very Small Array (VS A; 
Dickinson et al. (2004)) and BOOMERANG (Ruhl et al. 2003; Montroy et al. 2006; Piacentini et al. 2006). 
As in the three- year release we only use bandpowers that do not overlap with the signal-dominated WMAP 
data, due to non-trivial cross-correlations, so we use seven bandpowers for CBI (in the range 948 < £ < 1739), 
five for VSA (894 < £ < 1407) and seven for BOOMERANG (924 < £ < 1370), using the lognormal form 
of the likelihood. Constraints are also found by combining WMAP with the 2008 ACBAR data, using 16 
bandpowers in the range 900 < £ < 2000. In this case the other CMB experiments are not included. We do 
not use additional polarization results for parameter constraints as they do not yet improve limits beyond 
WMAP alone. 


4.2.2. Baryon Acoustic Oscillations 

The acoustic peak in the galaxy correlation function is a prediction of the adiabatic cosmological model. 
It was first detected using the SDSS luminous red galaxy (LRG) survey, using the brightest class of galaxies 
at mean redshift 2 : = 0.35 by Eisenstein et al. (2005). The peak was detected at 100/i -1 Mpc separation, 
providing a standard ruler to measure the ratio of distances to z = 0.35 and the CMB at z — 1089, 
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Fig. 11. — Baryon Acoustic Oscillations expected for the best-fit ACDM model (red lines), compared to 
BAO in galaxy power spectra calculated from (left) combined SDSS and 2dFGRS main galaxies, and (right) 
SDSS LRG galaxies, by Percival et al. (2007a). The observed and model power spectra have been divided 
by P(k) smoo t h, a smooth cubic spline fit described in Percival et al. (2007a). 


and the absolute distance to z = 0.35. More recently Percival et al. (2007b) have obtained a stronger 
detection from over half a million SDSS main galaxies and LRGs in the DR5 sample. They detect baryon 
acoustic oscillations (BAO) with over 99% confidence. A combined analysis was then undertaken of SDSS 
and 2dFGRS by Percival et al. (2007a). They find evidence for BAO in three catalogs: at mean redshift 
z = 0.2 in the SDSS DR5 main galaxies plus the 2dFGRS galaxies, at z = 0.35 in the SDSS LRGs, and in 
the combined catalog. Their data are shown in Figure 11, together with the WMAP best-fit model. The 
BAO are shown by dividing the observed and model power spectra by P(k) smoot h , a smooth cubic spline fit 
described in Percival et al. (2007a). The observed power spectra are model-dependent, but were calculated 
using = 0.25 and h = 0.72, which agrees with our maximum-likelihood model. 

The scale of the BAO is analyzed to estimate the geometrical distance measure at z = 0.2 and z = 0.35, 

Dv(z) = [(1 + z) 2 D 2 a cz/H(z)} 1/3 , ' (16) 

where Da is the angular diameter distance and H(z) the Hubble parameter. They find r s /Dy (0.2) = 0.1980=1= 
0.0058 and r s /Dy( 0.35) = 0.1094 ± 0.0033. Here r s is the comoving sound horizon scale at recombination. 
Our ACDM model, using the WMAP data alone, gives r s /Dy{ 0.2) = 0.1946 =L 0.0079 and r s /Dy( 0.35) = 
0.1165 =1= 0.0042, showing the consistency between the CMB measurement at z — 1089 and the late-time 
galaxy clustering. However, while the z — 0.2 measures agree to within lcr, the z — 0.35 measurements have 
mean values almost 2 a apart. The BAO constraints are tighter then the WMAP predictions, which shows 
that they can improve upon the WMAP parameter determinations, in particular on and 

We use the combined bounds from both surveys to constrain models as described in Percival et al. 
(2007a), adding a likelihood term given by — 2 InL = X T C~ l X , with 

X T = [r s /D v ( 0.2) - 0.1980, r s /D v (0.35) - 0.1094] (17) 

and C 11 = 35059, C 12 — —24031, C 22 = 108 3 00, including the correlation between the two measurements. 
We also consider constraints using the SDSS LRG limits derived by Eisenstein et al. (2005), using the 
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combination 

A(z) = D v (z)y/shJtf/cz (18) 

for z = 0.35 and computing a Gaussian likelihood —2 In L = (A — 0.469(n s /0.98)~ o,35 ) 2 /0.0 1 7 2 . See Komatsu 
et al. (2008) for further discussion of the BAO data. 


4-2.3. Galaxy power spectra 

We can compare the predicted fluctuations from the CMB to the shape of galaxy power spectra, in 
addition to the scale of acoustic oscillations. The SDSS galaxy power spectum from DR3 (Tegmark et al. 
2004) and the 2dFGRS spectrum (Cole et al. 2005) were shown to be in good agreement with the WMAP 
three-year data, and used to place tighter constraints on cosmological models (Spergel et al. 2007), but there 
was some tension between the preferred values of the matter density (Q m = 0.236 ±0.020 with 2dFGRS and 
0.265 ±0.030 with SDSS). 

More recently two studies used photometric redshifts to estimate the galaxy power spectrum from the 
photometric-redshift catalogue of LRGs from the SDSS fourth data release (DR4, Padmanabhan et al. (2007); 
Blake et al. (2007)). Padmanabhan et al. (2007) compute the 3D real-space clustering power spectrum of 
the SDSS LRGs, probing galaxies in redshift range 0.2 < z < 0.6. Their estimates of D m = 0.30 ± 0.03, and 
D&/D m = 0.18 ± 0.04, for a fixed Hubble constant of h — 70, are consistent with our data, and with the 
Blake et al. (2007) analysis which finds O m /i = 0.195 ± 0.023 and Ob/D m = 0.16 ± 0.036, for h = 0.75. 

More precise measurements of the LRG power spectrum were then obtained from redshift measurements: 
Tegmark et al. (2006) used LRGs from SDSS DR4 in the range O.Olh/Mpc < k < 0.2h/Mpc combined 
with the three-year WMAP data to place strong constraints on cosmological models. However, there is a 
disagreement between the matter density predicted using different minimum scales, if the non-linear modeling 
used in Tegmark et al. (2006) is adopted. Using the three-year WMAP data combined with the LRG spectrum 
we find f2 m = 0.228 ±0.019, using scales with k < 0.1 h Mpc -1 , and = 0.248 ±0.018 for k < 0.2 h Mpc -1 . 
These constraints are obtained for the 6 parameter ACDM model, following the non-linear prescription in 
Tegmark et al. (2006). This agrees with results obtained from the DR5 main galaxy and LRG sample 
(Percival et al. 2007c), who argue that this shows evidence for scale-dependent bias on large-scales, which 
could explain the observed differences in the early SDSS and 2dFGRS results. While the WMAP five year 
ACDM model, with fi m = 0.258 ± 0.030, is not inconsistent with the measured spectra, we choose not to 
use the galaxy power spectra to place joint constraints on the majority of models in Komatsu et al. (2008). 
For ACDM we test joint constraints using the SDSS DR3 and 2dFGRS data separately, using the method 
described in Spergel et al. (2007), and using the SDSS DR4 LRG spectrum following the method in Tegmark 
et al. (2006). 


4 . 2 . 4 . Type la Supernovae 

In the last decade Type la supernovae have become an important cosmological probe, and have provided 
the first direct evidence for the acceleration of the universe by measuring the luminosity distance as a function 
of redshift. The observed dimness of high redshift supernovae (z ~ 0.5) was first measured by Riess et al. 
(1998); Schmidt et al. (1998); Perlmutter et al. (1999), confirmed with more recent measurements including 
Nobili et al. (2005); Krisciunas et al. (2005); Clocchiatti et al. (2006); Astier et al. (2006), and extended to 
higher redshift by Riess et al. (2004) who found evidence for the earlier deceleration of the universe. The 
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Fig. 12. — The red line shows the luminosity distance relationship predicted for the best-fit WMAP-only 
model (right column in Table 2). The points show supernova observations from HST (Riess et al. 2007), 
ESSENCE (Miknaitis et al. 2007), and SNLS (Astier et al. 2006). The plot shows the deviation of the 
luminosity distances from the empty universe model. The binned data combination is from Wright (2007). 


sample of high redshift supernovae has grown by over 80 since the three-year WMAP analysis. Recent HST 
measurements of 21 new high redshift supernova by Riess et al. (2007) include 13 at z > 1, allowing the 
measurement of the Hubble expansion H(z ) at distinct epochs and strengthening the evidence for a period 
of deceleration followed by acceleration. The ESSENCE Supernova Survey has also recently reported results 
from 102 supernovae discovered from 2002 to 2005 using the 4-m Blanco Telescope at the Cerro Tololo Inter- 
American Observatory (Miknaitis et al. 2007), of which 60 are used for cosmological analysis (Wood-Vasey 
et al. 2007). A combined cosmological analysis was performed of the complete supernova data set by Davis 
et al. (2007). 

We confirm in Figure 12 that the recently observed supernovae are consistent with the ACDM model, 
which predicts the luminosity distance nth as a function of redshift and is compared to the observed data 
from three independent Type la supernovae samples. In our combined analysis we use 135 supernova in the 
‘Gold’ sample, consisting of the original Gold sample described in Riess et al. (2004), updated with revised 
distance measurements and supplemented by those observed using HST in Riess et al. (2007), together with 
60 from the ESSENCE survey, and 115 supernova from the Supernova Legacy Survey (SNLS, Astier et al. 
(2006)), as described in Spergel et al. (2007). For each supernova the luminosity distance predicted from 
theory is compared to the observed value. This is derived from measurements of the apparent magnitude 
m and the inferred absolute magnitude M, to estimate a luminosity distance ii 0 bs — 5 log[dL(z)/Mpc] + 25. 
The likelihood is given by 

-2 In L = 'Y'^HobsAzi) ~ M o )} 2 /°obs,i ( 19 ) 

i 

summed over all supernovae, where the absolute magnitude is marginalized over (Lewis & Bridle 2002) and 
(Jobs is the observational error accounting for extinction, intrinsic redshift dispersion, K-correction and light 
curve stretch factors. 
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Data 

Parameter 

Lensing limits 

5-year WMAP limits 

CFHTLS Wide 
100 Sq Deg 
COSMOS 2D 
COSMOS 3D 

< r 8 (O m /0.25)°- 64 
a 8 ( fWO.24) 0 - 59 
a 8 (O m /0.3)°- 48 
<7 8 (D m /0.3)°- 44 

0.785 ± 0.043 
0.84 ± 0.07 
0.81 ±0.17 
0.866io!o68 

0.814 ±0.090 
0.832 ± 0.088 
0.741 ± 0.069 
0.745 ±0.067 


Table 3: Measurements of combinations of the matter density, Q m , and amplitude of matter fluctuations, a g, from 
weak lensing observations (Fu et al. 2008; Benjamin et al. 2007; Massey et al. 2007), compared to WMAP. 


4-2.5. Hubble constant measurements 

The WMAP estimated value of the Hubble constant, Ho = 71$^®s~ l Mpc -1 , assuming a flat geometry, 
is consistent with the HST measurement of Ho = 72 ± 8 km s -1 Mpc -1 (Freedman et al. 2001), where the 
error includes random and systematic uncertainties. It also agrees within la with a set of measurements 
from gravitationally lensed systems (Koopmans et al. 2003), measurements from SZ and X-ray observations 
(Bonamente et al. 2006), and measurements of the Cepheid distances to nearby galaxies that give Ho = 
73 ±6 km s -1 Mpc -1 (Riess et al. 2005). More recent measurements include the calibration of the Cepheid 
distance scale using the distance to the Maser-host galaxy NGC4258 (Macri et al. 2006), with an inferred 
measure 74 ± 7 km s -1 Mpc' 1 that agrees with the WMAP value, and a new measure of the Tully-Fisher 
zero-point (Masters et al. 2006), that also implies 74 ± 7 km s -1 Mpc -1 . However, a compilation of Cepheid 
distance measurements for ten galaxies using HST by Sandage et al. (2006) prefer a lower Hubble constant 
of 62 ± 6 km s -1 Mpc -1 , almost 2 a lower than the WMAP value. A low measure is also favored by 
new measurements of an eclipsing binary in M33 (Bonanos et al. 2006), which would imply a reduction to 
Ho = 61 km s -1 Mpc -1 of the measurement by Freedman et al. (2001). The opposite effect is found using 
revised parallaxes for Cepheids (van Leeuwen et al. 2007), which would raise the Sandage et al. (2006) value 
to 70 ±5 km s -1 Mpc -1 and the Freedman et al. (2001) value to 76±8 km s -1 Mpc -1 . Despite this range of 
preferred values, none of these measurements are inconsistent with our WMAP result for ACDM. We only 
include Hubble constant results for a small set of our combined constraints, using the Freedman et al. (2001) 
value of 72 ± 8 km s -1 Mpc -1 , with a Gaussian prior. 


4.2.6. Weak Lensing 

Weak gravitational lensing is produced by the distortion of galaxy images by the mass distribution along 
the line of sight. There have been significant advances in the measurement of this effect in recent years, and 
in the understanding of systematic effects in lensing analyses (e.g., Massey et al. (2007)). There have been 
improvements in measurements of redshift distributions, and better understanding of intrinsic alignment 
effects (Hirata et al. 2007), making it a valuable cosmological probe complementary to the CMB. Many 
early results by lensing surveys favored higher amplitudes of mass fluctuations than preferred by WMAP: 
a 8 = 0.94to.i4(^m/0-25) -0 - 52 from the RCS survey (Hoekstra et al. 2002), 0.91±0.8(f2 m /0.25) - °- 49 from the 
VIRMOS-DESCART survey (Van Waerbeke et al. 2005), and 0.88 ± 0.06 (for D m = 0.3) from the Canada- 
France-Hawaii Telescope Legacy Survey (CFHTLS) (Hoekstra et al. 2006). However, new measurements of 
the two-point correlation functions from the third year CFHTLS Wide survey (Fu et al. 2008), covering 57 
deg 2 , favor a lower amplitude consistent with the WMAP measurements: <7 8 (f2 m /0.25)°- 64 = 0.785 ± 0.043, 
which agrees with the inferred value from WMAP. A comparison of the lensing results with the equivalent 
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limits derived from WMAP is given in Table 3. This shift is due to an improved estimate of the galaxy 
redshift distribution from CFHTLS-Deep (Ilbert et al. 2006), compared to the distribution obtained from 
Hubble Deep Field photometric redshifts, a small region of sky whose redshift distribution was dominated by 
systematic errors. Their measured signal is also in agreement with results from the 100 Square Degree Survey 
(Benjamin et al. 2007), a compilation of data from the earlier CFHTLS-Wide, RCS and VIRMOS-DESCAT 
surveys, together with the more recent GABoDS survey (Hetterscheidt et al. 2007) with sky coverage 113 
deg 2 and average source redshift z ~ 0.8. They find a8(D m /0.24)°- 59 = 0.84 =b 0.07, also consistent with 
WMAP . Both these analyses rely on a two-dimensional measurement of the shear field, comparing the 2D 
correlation functions with theoretical predictions. Cosmic shear has also been successfully measured in 
three-dimensions by the HST COSMOS survey (Massey et al. 2007), using redshift information to providing 
a measure of the mass fluctuation cr 8 (D m /0.3)°* 44 = 0.8661 q'o 68’ and a significant improvement over 2D 
limits from the same sample. This measure is somewhat higher than the WMAP value, as shown in Table 
3, although not inconsistent. 

Weak lensing is also produced by the distortion of the CMB by the intervening mass distribution. This 
signal can be probed by measuring the correlation of the gravitationally lensed CMB with tracers of large 
scale structure. A study was done with the WMAP first-year data combined with a sample of LRGs from 
SDSS, but no signal was detected, consistent with theoretical expectation (Hirata et al. 2004). Two recent 
analyses have found the first evidence for the cross-correlated lensing signal (Smith et al. 2007; Hirata et al. 
2008). They use the three-year WMAP data correlated with NVSS radio sources (Smith et al. 2007), finding 
a 3.4cr detection of the correlation, and WMAP combined with data from NVSS and SDSS, using both LRGS 
and quasars (Hirata et al. 2008), finding a correlation with a 2.1a level of significance. The cross-correlation 
in both cases is consistent with the five-year WMAP ACDM model. 

4.2.7. Integrated Sachs- Wolfe Effect 

Correlation between large-scale CMB temperature fluctuations and large-scale structure is expected in 
the ACDM model due to the change in gravitational potential as a function of time, and so provides a test 
for dark energy (Boughn et al. 1998). Evidence of a correlation was found in the first-year WMAP data 
(e.g., Nolta et al. (2004)), and has been improved with recent large scale structure data. A recent analysis 
by Ho et al. (2008) combine 2MASS, SDSS LRGs, SDSS quasars and NVSS radio sources with the WMAP 
three-year data, finding a 3.7cr detection of ISW, consistent with a universe described by the ACDM model. 
They also find a low signal at high redshift z > 1, as expected. A similar study by Giannantonio et al. (2008) 
find a 4 a detection of the correlation. Other recent studies have concentrated on individual data sets, finding 
a correlation at the level expected with the SDSS DR4 galaxies (Cabre et al. 2006), at high redshift with 
SDSS qusars (Giannantonio et al. 2006), and with the NVSS radio galaxies (Pietrobon et al. 2006; McEwen 
et al. 2007). 


4.2.8. Ly-a Forest 

The Lya forest seen in quasar spectra probes the underlying matter distribution on small scales, and so 
provides a long lever arm that can potentially place powerful constraints on the neutrino mass and a running 
spectral index. However, because they indirectly probe the underlying mass fluctuations, the relationship 
between absorption line structure and mass fluctuations must be fully understood to be used in a cosmological 
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analysis. The power spectrum of the Lya forest has been used to constrain the shape and amplitude of the 
primordial power spectrum (Viel et al. 2004; McDonald et al. 2005; Seljak et al. 2005; Desjacques & Nusser 
2005), and recent results combine the three-year WMAP data with the power spectrum obtained from 
the LUQAS sample of VLT-UVES spectra (Viel et al. 2006) and SDSS QSO spectra (Seljak et al. 2006). 
Both groups found similar results, suggesting a higher value for erg than consistent with WMAP . However, 
measurements by Kim et al. (2007) of the probability distribution of the Lya flux have been compared to 
simulations with different cosmological parameters and thermal histories (Bolton et al. 2007). They imply 
that the temperature-density relation for the IGM is close to isothermal or inverted, which would result in 
a smaller amplitude for the power spectrum than previously inferred. This reduction in erg would bring the 
constraints more in line with the five-year WMAP value of erg = 0.796 ± 0.036. Given these uncertainties, 
we choose not to use the Lya forest data for the main results presented in Komatsu et al. (2008). However, 
constraints on the running of the spectral index are discussed, using data described in Seljak et al. (2006). 
With more data and further analyses, the Lya forest measurements, which probe small scales at intermediate 
redshifts, have the potential to be a powerful cosmological test. 


4-2.9. Big- Bang Nucleosynthesis 

WMAP measures the baryon abundance at decoupling, and the ACDM model constraint is = 

0.02273 ±0.00062, with a baryon to photon ratio of r;io(WMAP) = 6.225 ±0.170. Light element abundances 
also depend on the baryon abundance, but at an earlier epoch, in the first few minutes after the Big Bang. 
The consistency of these measurements is therefore a powerful test of the Big Bang model. We can infer 
the BBN-predicted values of 7710 that correspond to the observed abundances of deuterium, helium, and 
lithium, and compare them to the CMB. Steigman (2007) reviews the current status of BBN measurements. 
Deuterium measurements provide the strongest test, and give rjio(D) = 6.0 ±0.4 based on new measurements 
by O’Meara et al. (2006) of a metal-poor damped Lya system along the sight line of QSO SDSS 1558- 
003. This is consistent with the WMAP measurement. The 3 He abundance is more poorly constrained 
at 7710 ( 3 He) = 5. 61^4 from the measure of 7/3 = 1.1 ± 0.2 by Bania et al. (2002). The abundance of 
4 He corresponds to smaller baryon abundances than consistent with WMAP: 7710 ( 4 He) = 2.7tJ;g from a 
measure of Yp = 0.240 ± 0.006, incorporating data from Izotov & Thuan (2004); Olive & Skillman (2004); 
Gruenwald et al. (2002) by Steigman (2007). However, observations of five HII regions using new atomic 
data by Peimbert et al. (2007) predict 7710 ( 4 He) = 7.8 ± 2, and higher values are also favored by Fukugita & 
Kawasaki (2006), both more consistent with WMAP. 

Measurements of the neutral lithium abundance in low-metallicity stars indicate values two times smaller 
than the BBN prediction from the CMB. The abundance is often expressed as logarithmic abundance, 
[Li]p = 12 ± log 10 (Li/H), and measurements have found [Li]p ~ 2.2 — 2.25 (Charbonnel & Primas 2005), 
[Li] p ~ 2.37 ± 0.05 (Melendez & Ramirez 2004) and more recently [Li]p ~ 2.1 by Asplund et al. (2005). 
These measurements pose a potential problem for the standard picture, and could be a signature of new early 
universe physics (see e.g., Coc et al. (2004); Richard et al. (2005); Jedamzik (2004)). However, arguments 
have been made that the discrepancy could be due to systematics, destruction of lithium in an earlier 
generation of stars, or uncertainties in the stellar temparature scale (Fields et al. 2005; Steigman 2006; 
Asplund et al. 2005). A possible solution has recently been proposed using new observations of stars in 
the globular cluster NGC 6397 (Korn et al. 2006). They find evidence that as the stars age and cool, the 
proportion of lithium in their atmospheres drops, indicating that transport processes within the star lead to 
the destruction of lithium. By extrapolating backwards in time, they infer an initial lithium content of the 



30 - 


stars [Li] p = 2.54 ± 0.1. This is in good agreement with BBN predictions, giving 7710 (Li) = 5.4 ± 0.6. More 
observations will help determine whether this is the correct explanation for the observed discrepancy. 


4.2.10. Strong Lensing 

Because the strong lensing cross-section is exponentially sensitive to distance, the number of strongly 
lensed quasars has the potential to probe cosmology, as a dark energy dominated universe predicts a large 
number of gravitational lenses (Turner 1990; Fukugita et al. 1990). Until recently, attempts to use this 
method have been limited both by the lack of large lens searches with well- understood systematics, and 
by uncertainties in both the source and lens populations. However, for both optically and radio-selected 
strongly lensed surveys, the observational data continues to improve. The CLASS radio band survey now 
has a large statistical sample of radio lenses (Myers et al. 2003; Koopmans et al. 2003; York et al. 2005). 
Analyses of this sample yields estimates for Da — 0.72 — 0.78 (Mitchell et al. 2005; Chae 2007). Oguri 
et al. (2008) have recently analyzed the large statistical lens sample from the Sloan Digital Sky Quasar Lens 
Search (Oguri et al. 2006), a sample that avoids many of the limitations of previous optical surveys. For a 
w = -1, flat cosmology, they find Da = 0.741q i 5 (stat.)lo ^(syst.). These values are all consistent with our 
best fit cosmology. 

The abundance of giant arcs has the potential to probe both the mass distribution in clusters and the 
underlying cosmology. Recent surveys have detected large numbers of giant arcs (Gladders et al. 2003; Sand 
et al. 2005; Hennawi et al. 2008). A number of authors have argued that their number and size is problematic 
for ACDM (Li et al. 2006; Broadhurst & Barkana 2008). This discrepancy has led to interest in early dark 
energy models (Fedeli & Bartelmann 2007) that predict higher lensing cross-sections. However, because of 
the sensitivity to the mass distribution, there is controversy over the predictions of ACDM models. Detailed 
numerical simulations (Meneghetti et al. 2007; Hennawi et al. 2007; Hilbert et al. 2007a; Neto et al. 2007) 
find that the lens cross-sections are sensitive to the details of the mass distribution as well as to the effect 
of baryon physics (Wambsganss et al. 2007; Hilbert et al. 2007b): most of these effects tend to increase 
the predicted number of giant arcs. Detailed comparisons between numerical simulations and systematic 
observational studies will be essential for this technique to be a useful test of cosmological models. 


4.2.11. Galaxy clusters 

Because clusters are easily detected and probe the high mass end of the mass distribution, they probe 
the amplitude of density fluctuations and of large-scale structure. Cluster observations at optical wavelengths 
provided some of the first evidence for a low density universe with the current preferred cosmological param- 
eters (see e.g., the Fan et al. (1997) analysis which found as = 0.83 =b 0.15 and D m = 0.3 =b 0.1). Observers 
are now using a number of different techniques for identifying cluster samples: large optical samples, X-ray 
surveys, lensing surveys (see e.g., Wittman et al. (2006)) and Sunyaev-Zeldovich surveys. The challenge for 
all of these survey techniques is to accurately determine the selection function and the relationship between 
astronomical observables and mass. Over the past few years, there has been significant progress in improv- 
ing the astronomical observable mass/light relationship in both the optical (Lin et al. 2006; Sheldon et al. 
2007; Reyes et al. 2008; Rykoff et al. 2008) and the X-ray (Sheldon et al. 2001; Reiprich & Bohringer 2002; 
Kravtsov et al. 2006; Arnaud et al. 2007; Hoekstra 2007). There are now large new optical cluster samples 
(Bahcall et al. 2003; Hsieh et al. 2005; Miller et al. 2005; Koester et al. 2007) and new X-ray samples (Pierre 
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et al. 2006). 

While there has been tension between cr 8 estimates from different cluster techniques, most recent results 
are converging on values close to the WMAP best fit value of erg = 0.796 ± 0.036. Cosmological constraints 
from RCS (Gladders et al. 2007) finds f = 0.30lo.ii an d = 0.70io.i 7 * Rozo et al. (2007) argues for 
08 > 0*^6 (95% confidence level) from SDSS BCG samples. Mantz et al. (2007) find D m = 0.271 q os an d 
08 = 0.77^o 06 f° r a ^ a ^ m °del based on the Jenkins et al. (2001) mass function and the Reiprich & Bohringer 
(2002) mass- luminosity calibration. With a 30% higher zero-point, Rykoff et al. (2008) find that their data 
are best fit by a 8 = 0.85 and = 0.24. Berge et al. (2007) report cr 8 — 0.921qJo f° r = 0.24 from 
their joint CFHTLS/XMM-LSS analysis. Over the next few years, cosmologists anticipate a number of large 
SZ surveys that have the potential to make accurate measurements of a 8 . Planck (Bartlett et al. 2008) 
should detect a large and well defined cluster sample. Simulations and theoretical analysis suggest a tight 
correlation between SZ signal and cluster mass (da Silva et al. 2004; Motl et al. 2005; Nagai 2006; Reid & 
Spergel 2006; Afshordi 2007). While SZ surveys are beginning to test these relationships (Muchovej et al. 
2007), they will need to be empirically tested in large samples before the SZ measurements are competitive 
with other techniques. 


4-2.12. Galaxy peculiar velocities 

By measuring galaxy velocity fields, observations can probe the growth rate of structure. For nearby 
galaxies where accurate density measurements are possible, these galaxy velocity fields can be compared 
directly to observations of large-scale structure. For larger scale surveys, measurements of redshift space 
distortions can be used to infer the growth rate of structure. While none of these measurements yet has the 
precision of the cosmological tests used in the joint analyses, they do provide an independent consistency 
check (see Nesseris & Perivolaropoulos (2008) for a recent review). 

One approach is to use objects with well-determined distances to look for deviations from the Hubble 
flow. Measurements of Tully-Fisher distances to galaxies enable a direct comparison of these deviations from 
the Hubble flow to the local density field (Strauss & Willick 1995; Dekel 2000; Zaroubi et al. 2001). For 
example, the Park & Park (2006) analysis of the peculiar velocities in the SFI sample find <r 8 ^m 6 = 0.56io.ii- 
With the much large SFI++ sample (Springob et al. 2007), the precision of this test should improve. Nearby 
supernova also show deviations from the local flow and can be used as velocity field probes (Riess et al. 
1995; Haugbplle et al. 2007). Using the currently available data sets, Gordon et al. (2007) correlate peculiar 
velocities of nearby supernova and find cr 8 = 0.79 ± 0.22. 

With the development of deep large-scale structure surveys, cosmologists have now been able measure 
/?, the amplitude of redshift space distortions as a function of redshift. When combined with a measurement 
of the bias, 5, this yields a determination of the growth rate of structure / = d\nG/d\na = /3b , where G 
is the growth factor. For Einstein gravity theories, regardless of the nature of the dark energy, we expect 
/ ^ D 7 with 7 ~ 6/11 (see Polarski & Gannouji (2007) for a more accurate fitting function). Analysis of 
redshift space distortions in the 2dF galaxy redshift survey (Peacock et al. 2001; Verde et al. 2002; Hawkins 
et al. 2003) find (3 = 0.47 ± 0.08 at z ~ 0.1, consistent with the ACDM predictions for the best fit WMAP 
parameters. The Tegmark et al. (2006) analysis of the SDSS LRG sample finds 0 = 0.31 ±0.04 at z = 0.35. 
The Ross et al. (2007) analysis of the 2dF-SDSS LRG sample find 0 = 0.45 ± 0.05 at z = 0.55. The Guzzo 
et al. (2008) analysis of 10,000 galaxies in the VIMOS-VLT Deep Survey finds that 0 = 0.70 ±0.26 at z = 0.8 
and infer d\nG/d\na = 0.91 ±0.36, consistent with the more rapid growth due to matter domination at this 
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1.4 : — r < 0.58 (5yr) 
12 - — r < 1.10 (3yr) 


Fig. 13. — Two-dimensional marginalized constraints (68% and 95% confidence levels) on inflationary pa- 
rameters r, the tensor-to-scalar ratio, and n s , the spectral index of fluctuations, defined at k o = 0.002/Mpc. 
One-dimensional 95% upper limits on r are given in the legend. Left: The five-year WMAP data places 
stronger limits on r (shown in blue) than three-year data (grey). This excludes some inflationary models 
including A</> 4 monomial inflaton models with r ~ 0.27, n s ~ 0.95 for 60 e-folds of inflation. Right: For 
models with a possible running spectral index, r is now more tightly constrained due to measurements of 
the third acoustic peak. Note: the two-dimensional 95% limits correspond to A(21nL) ~ 6, so the curves 
intersect the r = 0 line at the ~ 2.5cr limits of the marginalized n s distribution. 


epoch expected in a ACDM model. The da Angela et al. (2008) analysis of a QSO sample finds (3 = 0.60 tg Jt 
at 2 = 1.4 and use the clustering length to infer the bias. Extrapolating back to 2 = 0, they find a matter 
density of f l m = 0.25^0 07 - 

Peculiar velocity measurements can also probe the shape of the galaxy power spectrum. The Watkins 
& Feldman (2007) analysis of large scale flows finds that the power spectrum space parameter T ~ = 

0.13to os? consistent with ACDM values. 


5. Extended cosmological models with WMAP 

The WMAP data place tight constraints on the simplest ACDM model parameters. In this section we 
describe to what extent WMAP data constrain extensions to the simple model, in terms of quantifying the 
primordial fluctuations and determining the composition of the universe beyond the standard components. 
Komatsu et al. (2008) present constraints for WMAP combined with other data, and offer a more detailed 
cosmological interpretation of the limits. 
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Fig. 14.— Two-dimensional marginalized limits for the spectral index, n s , defined at k 0 = 0.002/Mpc, and 
the running of the index dn s /d\nk (marked n run ). Models with no tensor contribution, and with a tensor 
contribution marginalized over, are shown. In both cases the models are consistent with a power-law spectral 
index, with dn s /d\nk = 0, as expected from the simplest inflationary models. 

5.1. Primordial perturbations 

5.1.1. Tensor fluctuations 

In the ACDM model, primordial scalar fluctuations are adiabatic and Gaussian, and can be described 
by a power law spectrum, 

/ k \ Us ~ 1 

A 2 n (k) (x (-) , (20) 

producing CMB angular power spectra consistent with the data. Limits can also be placed on the amplitude 
of tensor fluctuations, or gravitational waves, that could have been generated at very early times. They leave 
a distinctive large-scale signature in the polarized B-mode of the CMB (e.g., Basko & Polnarev (1980); Bond 
& Efstathiou (1984)), that provides a clean way to distinguish them from scalar fluctuations. However, we 
have not yet reached sensitivities to strongly constrain this signal with the polarization data from WMAP. 
Instead we use the tensor contribution to the temperature fluctuations at large scales to constrain the tensor- 
to-scalar ratio r. We define r = A^(A:o)/A^(/co), where A^ is the amplitude of primordial gravitational waves 
(see Komatsu et al. (2008)), and choose a pivot scale ko — 0.002/Mpc. 

The WMAP data now constrain r < 0.43 (95% CL). This is an improvement over the three-year limit 
of r < 0.65 (95% CL), and comes from the more accurate measurement of the second and third acoustic 
peaks. The dependence of the tensor amplitude on the spectral index is shown in Figure 13, showing the 
n s — r degeneracy (Spergel et al. 2007): a larger contribution from tensors at large scales can be offset by 
an increased spectral index, and an overall decrease in the amplitude of fluctuations, shown in Table 4. The 
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Parameter 

Tensors 

Running 

Tensors ± Running 

r 

dn s /d In k 
n s 
erg 

< 0.43 (95% CL) 
0.986 ± 0.022 

O 777+0.040 
1 1 -0.041 

-0.037 ±0.028 

1 031 + 0 054 
1 .Uol _ 0 055 

0.816 ±0.036 

< 0.58 (95% CL) 
-0.050 ± 0.034 
1 - 0871^73 
0.800 ± 0.041 


Table 4: Selection of cosmological parameter constraints for extensions to the ACDM model including tensors and/or 
a running spectral index. 


degeneracy is partially broken with a better measure of the TT spectrum. There is a significant improvement 
in the limit on models whose scalar fluctuations can vary with scale, with a power spectrum with a ‘running’ 
spectral index, 




( 21 ) 


The limit from WMAP is now r < 0.58 (95% CL), about half the three-year value r < 1.1 (Spergel et al. 
2007). 


What do these limits tell us about the early universe? For models that predict observable gravitational 
waves, it allows us to exclude more of the parameter space. The simplest inflationary models predict a nearly 
scale-invariant spectrum of gravitational waves (Grishchuk 1975; Starobinsky 1979). In a simple classical 
scenario where inflation is driven by the potiential V (0) of a slowly rolling scalar field, the predictions (Lyth 
k Riotto 1999) are 


r ~ 4 a/N (22) 

1 — n s - (a + 2)/2N (23) 

for V{(j>) oc </> Q , where N is the number of e-folds of inflation between the time when horizon scale modes left 
the horizon and the end of inflation. For N=60, the A 0 4 model with r ~ 0.27, n s ~ 0.95 is now excluded 

with more than 95% confidence. An ra 2 </> 2 model with r ~ 0.13, n s ~ 0.97 is still consistent with the data. 

Komatsu et al. (2008) discuss in some detail what these measurements, and constraints for combined data 
sets, imply for a large set of possible inflationary models and potentials. 

With r = 0 also fitting the data well, models that do not predict an observable level of gravitational 
waves, including multi-field inflationary models (Polarski k Starobinsky 1995; Garcia-Bellido k Wands 1996), 
D-brane inflation (Baumann k McAllister 2007), and ekpyrotic or cyclic scenarios (Khoury et al. 2001; Boyle 
et al. 2004), are not excluded if one fits for both tensors and scalars. 


5.1.2. Scale dependence of spectral index 

The running of the spectral index has been the subject of some debate in light of WMAP observations, 
with the three-year data giving limits of dn s /d\n k = — 0.055±0.030, showing some preference for decreasing 
power on small scales (Spergel et al. 2007). Combined with high-£ CBI and VS A CMB data, a negative 
running was preferred at ~ 2a. A running index is not predicted by the simplest inflationary models (see e.g., 
Kosowsky k Turner (1995)), and the detection of a scale dependence would have interesting consequences 
for early universe models. Deviations from a power law index, and their consequences, have been considered 
by a number of groups in light of three-year data, including Easther k Peiris (2006); Kinney et al. (2006); 
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Fig. 15. — Marginalized two-dimensional limits (68% and 95%) on the amplitude of possible CDM entropy 
(or isocurvature) fluctuations. The one-dimensional 95% upper limits are given in the legend. Left: Anti- 
correlated fluctuations are tightly constrained, placing limits on curvaton models. Right: Uncorrelated 
fluctuations, corresponding to axion models for dark matter, add less power to the CMB spectrum than the 
anti-correlated case, for a given a , so higher values of ao are allowed (than a:_i), by the data. In both cases 
the amplitude is correlated with the spectral index of curvature fluctuations n Sl which compensates for the 
large scale power added by the CDM entropy fluctuations. 


Shafieloo & Souradeep (2007); Verde h Peiris (2008), using various parameterizations. In this analysis, and 
in Komatsu et al. (2008) we consider only a running index parameterized as in Eqn 21. 

We show in Figure 14 that with a better determination of the third acoustic peak, coupled with improved 
beam determination, the five-year WMAP data do not significantly prefer a scale-dependent index. The limit 
on the running is dn s /d\nk = —0.037 ± 0.028 for models with no tensor contribution. The running is anti- 
correlated with the tensor amplitude (Spergel et al. 2003, 2007), so the positive prior on the tensor amplitude 
leads to a more negative running preferred, dn s /d\nk = —0.050 ± 0.034, when a tensor contribution is 
marginalized over. Both limits are well within 2 a of zero, showing no evidence of departure from a power 
law spectral index. 


5.1.3. Entropy perturbations 

The simplest classical single-field inflation models predict solely adiabatic fluctuations, but entropy (or 
isocurvature) fluctuations are also predicted in a wide range of scenarios, including axions (Seckel & Turner 
1985; Linde 1985), multi-field inflation (Polarski & Starobinsky 1994; Garcia-Bellido & Wands 1996; Linde 
& Mukhanov 1997), and decay of fields such as the curvaton (Lyth & Wands 2002; Moroi & Takahashi 2001; 
Bartolo & Liddle 2002; Lyth et al. 2003). They may be correlated with the adiabatic fluctuations to some 
degree, depending on the model. Most physical scenarios generate only CDM or baryon entropy fluctuations 
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Parameter 

p = - i 

o 

II 

a_i 

< 0.011 (95% CL) 


ot o 


< 0.16 (95% CL) 

n s 

0.983 zb 0.017 

0.987 ±0.022 


0 '77Q +0.039 
U. / /O_0.038 

0.777 ±0.038 


Table 5: Subset of cosmological parameter constraints for ACDM models with additional anti-correlated (/ 3 = — 1) 
or uncorrelated (/ 3 = 0) entropy fluctuations. 


(Bond & Efstathiou 1987; Peebles 1987), with perturbation 

5 c = — - ^1, (24) 

Pc 4p 7 

for CDM with density p c . Neutrino modes are also possible (Bucher et al. 2000). It has been known for 
some time that the CMB data cannot be fit by pure entropy fluctuations (Stompor et al. 1996; Langlois & 
Riazuelo 2000), but a contribution may be allowed. Several groups have placed limits on a variety of models 
using the WMAP one-year and three-year data (Peiris et al. 2003; Valiviita & Muhonen 2003; Bucher et al. 
2004; Beltran et al. 2004; Dunkley et al. 2005; Kurki-Suonio et al. 2005; Lewis 2006; Bean et al. 2006; Trotta 
2007; Keskitalo et al. 2007), finding no strong evidence for entropy fluctuations. Significant levels have been 
found to be consistent with the CMB data (Bucher et al. 2004; Moodley et al. 2004; Bean et al. 2006), but 
require correlated admixtures of CDM and neutrino isocurvature perturbations that are hard to motivate 
physically. 

Here, and in Komatsu et al. (2008), we quantify the relative contributions to the angular power spectrum 
following Beltran et al. (2004); Bean et al. (2006), with 

C e = ( 1 - a)C f + aCf 4- 2/V«(l - <*)C? , (25) 

summing the spectra from curvature fluctuations Cf', entropy fluctuations Cf with power spectrum 

A|(fc)5 3 (k - k') = (fc/27r) 3 («S c (k)5 c (k')), (26) 

and a cross-correlation spectrum C f with power spectrum (fc)5 3 (k — k') = (fc/27r) 3 (— 7£(k)<S c (k')). This 
follows the definition of the curvature perturbation 7Z in Komatsu et al. (2008), which gives large-scale 
temperature anisotropy A T/T = —7Z/5. An anti-correlated spectrum with f3 = — 1 gives a positive Cf on 
large scales. 

Limits are found for a_i, corresponding to anti-correlated models with f3 = — 1. This could correspond 
to a curvaton scenario in which inflation is driven by an inflaton field, but CDM perturbations are gener- 
ated by the decay of a distinct curvaton field (see e.g., Lyth &; Wands (2002)). In this case we make the 
assumption that the spectral index of the anti-uncorrelated entropy fluctuations is equal to the adiabatic 
spectral index. We do not find evidence for curvaton entropy pertubations, finding a limit from WMAP of 
a_i < 0.011 (95% CL), shown in Figure 15 and in Table 5. This is half the three-year limit, and places 
strong lower limits on the possible density of the curvaton at its decay, in this scenario, compared to the 
total energy density (Komatsu et al. 2008). IF the curvaton dominated at decay, the perturbations would 
be purely adiabatic. Where has the improvement come from? The pure entropy spectrum and the cross- 
correlated spectrum both add large scale power, so a similar degeneracy is seen with the spectral index, and 
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Fig. 16. — Evidence for a non-zero effective number of neutrino species, iVeff. Left: The marginalized 
probability distribution gives iV e ff > 2.3 (95% CL) from WMAP alone. The best-fit ACDM model with 
iV e ff = 0 is a poorer fit to the data than iVeff = 3, with A\ 2 = 8.2. Inferred 95% limits from big bang 
nucleosynthesis (BBN) observations are highlighted. Right: Joint two-dimensional distribution for iV e ff and 
the CDM density, fi c /i 2 , with five-year limits in blue, compared to three-year limits in grey. The degeneracy 
valley of constant z eq is shown dashed, indicating that the CMB is now sensitive to the effect of neutrino 
anisotropic stress, which breaks the degeneracy. 


erg , as in the case where the tensor amplitude is varied. The entropy spectrum is also out of phase with the 
adiabatic spectrum, so the improved TE measurements combine with the third peak TT spectrum to tighten 
the limits. 

We also place limits on ao> corresponding to an uncorrelated model with (3 = 0. In this case the entropy 
spectral index is set to be scale invariant. Komatsu et al. (2008) describe how this corresponds to entropy 
perturbations created by axions, which would constitute some part of the dark matter budget. The limit is 
oo < 0-15 (95% CL), ten times higher than the anti-correlated amplitude, but still preferring pure adiabatic 
fluctuations. Without the large-scale power contribution from the anti-correlated spectrum, a much larger 
amplitude is permitted, but with the same degeneracy with the spectral index and erg. This has implications 
for the maximum deviation from adiabaticity of axion dark matter and photons. Komatsu et al. (2008) 
provide a discussion of the theoretical implications of these limits, and those for combined data, for both 
models considered. 


5.2. Composition and geometry of the Universe 

5.2.1. Number of relativistic species 

Neutrinos are expected to play an important role in the dynamics of the early universe. For standard 
parameters, they contribute about 40% of the energy density of the universe during the radiation epoch 
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Fig. 17. — Comparison of the CMB angular power spectrum for the best-fit ACDM models with the standard 
iV e ff = 3.04 neutrino species (red), and with AT eff = 0 species (blue). The lower panel shows the fractional 
difference between the two spectra when iV e ff is increased from 0 to 3.04. The iV e ff — 0 model has a lower 
fl m h 2 in order to fit the third peak, and a lower spectral index, n s , compared to the N e fi = 3.04 model. 


and about 11% of the energy density of the universe at z ~ 1100 (very close to the energy density in 
baryons). Because neutrinos contribute to the expansion of the universe and stream relativistically out of 
density fluctuations, they produce a significant imprint on the growth rate of structure and on the structure 
of the microwave background fluctuations. The amplitude of these effects depend upon iV e ff, the number of 
effective neutrino species. By ‘effective neutrinos species’, we are counting any particle that is relativistic at 
z ~ 1000 — 3000, couples very weakly to the baryon-electron-photon fluid, and has very weak self-interactions. 
Because we know neutrinos exist, we associate ‘neutrinos’ with ‘light relativistic particle’, but note that in 
the strictest sense we limit only light relativistic species, as the cosmological constraints are sensitive to the 
existence of any light species produced during the big bang or any additional contribution to the energy 
density of the universe (e.g., primordial magnetic fields). 

Measurements of the width of the Z provide very tight limits on the number of neutrino species: N u — 
2.984 ± 0.008 (Particle Data Book), consistent with the 3 light neutrino species in the standard model. 
Because of non-thermal effects due to the partial heating of neutrinos during the e ± annihilations, and 
other small corrections, the effective number of species is 3.0395 (Dicus et al. 1982; Mangano et al. 2002). 
Most analyses of the number of neutrino species with three-year WMAP data (Spergel et al. 2007; Ichikawa 
et al. 2007; Mangano et al. 2007; Hamann et al. 2007; de Bernardis et al. 2007) relied on combining CMB 
measurements with probes of the growth rate of structure. Since one of the signatures of the number of 
neutrino species is a change in the growth rate of structure, there are degeneracies between the properties of 
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Parameter 

Limits 

AU 

> 2.3 (95% CL) 


< 1.3 eV (95% CL) 

Yp 

< 0.45 (95% CL) 


Table 6: Constraints on neutrino properties and the primordial helium fraction. 


the neutrinos and of the dark energy. Neutrinos, however, leave a distinctive signature directly on the CMB 
power spectrum (see Bashinsky & Seljak (2004) for detailed discussion): the neutrinos not only suppress 
the CMB peak heights, they also shift the acoustic peak positions. While the effects that depend on shifts 
in the epoch of matter /radiation equality are degenerate with changes in the matter density, the effects of 
neutrino free-streaming are distinct. Changes in the baryon/matter ratio and the baryon/photon ratio also 
have their own imprints on the Silk damping scale and on the acoustic scale. With five years of data, we are 
now able to see evidence of the effects of the neutrinos on the CMB power spectrum. 

Figure 16 shows the limits on the number density of neutrinos and the density in dark matter. The 
degeneracy valley, shown in the right panel, corresponds to a constant ratio of matter density to radiation 
density, or equivalently a measurement of the expansion factor at matter radiation equality: 


1 4- z eq 


40500 


Pc + pb 
P'y T Pv 

n c h 2 + n b h 2 

1 4- 0.23iV eff ’ 


(27) 


With only 3 years of data and a lack of precision on the third peak position and height, WMAP was not able 
to make a clear detection of neutrinos (or relativistic species); however, the data did provide a ~ 2a hint 
of the effects of neutrino anisotropic stresses (Melchiorri & Serra 2006). Figure 16 shows that the five year 
data alone, we now constrain the number density of relativistic species: N e fi > 2.3 (95% CL). By bounding 
7V e ff < 10, and choosing a uniform prior on iV e ff, this level of significance depends somewhat on the prior. 
We therefore test the significance of the constraint by comparing two ACDM models: one with N e ^ — 0, 
and one with the standard N e fi = 3.04. We find that the data prefer N e ft — 3.04. The best-fit model has 
A(— 21nL) = 8.2 less than the N e fi = 0 best-fit model, corresponding to evidence for relativistic species 
at > 99.5% confidence. The CMB power spectra corresponding to these two models, and their fractional 
difference, are shown in Figure 17. The model with no neutrinos has a lower matter density, in order 

to keep z eq fixed. The improvement in likelihood between the two models comes from both the \ow-£ and 
high-f TT spectrum, with a small contribution from the TE spectrum. We also check that this evidence does 
not go away if we relax the assumption of a power-law spectral index, by testing a model with a variable 
running, dn s /d\nk ^ 0. 

Komatsu et al. (2008) combine WMAP data with other distance indicators (which constrain Q c h 2 ) and 
finds a stronger limit on the number density of neutrino species: N e ^ = 4.4 ± 1.5. These limits will continue 
to improve as CMB measurements of the higher peaks improve. The CMB constraints on the number of 
relativistic species at redshift ~ 1000-3000 complement constraints from big bang nucleosynthesis and from 
particle accelerators. Measurements of the abundance of Helium are sensitive to the expansion rate of the 
universe during its first few minutes (Steigman et al. 1977). The agreement between the best fit value from 
big bang nucleosynthesis, 7V eff = 3.24 ± 1.2 (95% confidence interval) (Cyburt et al. (2005), Particle Data 
Book 2007), with the best fit CMB value is another consistency check for standard cosmology. 
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Fig. 18. — Limits on the sum of neutrino masses with the WMAP five-year data. Left: The marginalized 
one-dimensional limit from WMAP alone is ^ m u < 1.3 eV (95% CL). This is raised by <10% with 
marginalization over a running spectral index, tensor fluctuations, or a dark energy equation of state w. 
Right: The neutrino mass is anti-correlated with cr 8 , the amplitude of matter fluctuations. 


5.2.2. Neutrino mass 

Cosmological data places limits on the mass of neutrinos. Atmospheric and solar neutrino experiments 
show that neutrinos are massive (see Mohapatra et al. (2005)), and measure the difference between the square 
of their masses, m 2 ui — m|C. Cosmological measurements constrain the sum of the masses due to their 

effect on the propagation of perturbations, on the clustering of matter, and on the expansion rate of the 
universe (Bond & Szalay 1983; Ma 1996; Hu et al. 1998). The mass has a large effect on the matter power 
spectrum, as massive neutrinos do not cluster as well as cold dark matter, leading to a suppression in power 
on small scales. Neutrinos also affect the CMB at earlier times: if the fraction of dark matter that is warm 
is raised, acoustic oscillations in the photon-baryon plasma are less strongly damped for modes that entered 
the horizon while the neutrinos were relativistic, raising the acoustic peak amplitudes. The radiation-like 
behavior at early times also changes the expansion rate, shifting the peak positions. 

These effects are somewhat degenerate with other parameters, so CMB data alone cannot limit the 
mass as well as when combined with other data. With the three- year WMAP data alone the limits were 
< 1.8 eV (Spergel et al. 2007), and < 0.66 eV when combined with other data. Since the three-year 
WMAP analysis there have been many studies of the constraints, as discussed in Komatsu et al. (2008). 

The five-year WMAP data now gives an upper limit on the total mass to be < 1.3 eV (95% CL), 

shown in Table 6. We have checked that this upper limit is robust to the choice of cosmological models. 
The upper limit is raised by < 10% when we include tensor fluctuations, a running spectral index, or a 
constant w / — 1 equation of state of dark energy, as shown in Figure 18. This dependence on additional 
parameters is consistent with earlier investigations by e.g., Crotty et al. (2004); Zunckel & Ferreira (2007). 
A larger neutrino mass raises the amplitude of the higher acoustic peaks, hence the observed degeneracy 
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with <t 8 (Figure 18). Stronger constraints come from combining the CMB data with probes of the expansion 
rate and clustering of matter at later times: Komatsu et al. (2008) find Yh m v < 0.61 (95% CL) for WMAP 
combined with additional data. 


5.2.3. Primordial Helium Abundance 

In most cosmological analyses the primordial helium abundance is fixed to be Y P = 0.24, motivated by 
observations discussed in Sec 4.2.9. The effect of the abundance on the CMB spectrum is small, but provides 
an independent cross-check of the BBN results, and probes for any difference between the helium abundance 
during the first few minutes, and after 300,000 years. The abundance affects the CMB at small scales due 
to the recombination process. The number density of electrons before recombination depends on the helium 
fraction through n e = n&( 1 — Yp) where is the baryon number density. Changing the electron number 
density changes the mean free path of Compton scattering, which affects the Silk damping scale. A larger 
Y p increasingly damps the power on small scales, as shown in Trotta & Hansen (2004). 

Constraints from the first-year WMAP data were presented in Trotta & Hansen (2004); Huey et al. 
(2004); Ichikawa & Takahashi (2006), with 99% upper limits of Y P < 0.65 inferred (Trotta & Hansen 2004). 
A subsequent analysis of the three-year data gave Y P < 0.61 at 95% confidence, tightened to 0.25 ±0.10 with 
small-scale CMB data (Ichikawa et al. 2007). We now find Y P < 0.45 (95% CL) with the five-year WMAP 
data. Higher values allowed by the thre-year data are disfavored with a better measure of the third acoustic 
peak height. With future small-scale CMB measurements constraints should significantly improve (Ichikawa 
et al. 2007). 


5.2.4 - Curvature of the universe 

In combination with other data, WMAP observations place strong constraints on the geometry of 
the universe (Spergel et al. 2007). The CMB measures with high accuracy the angular scale at which 
acoustic oscillations are imprinted at the last scattering surface, 6* = 0.5952 ± 0.0017 °. However, this 
alone does not provide a good measure of the geometry, as there is a degeneracy with the expansion rate 
of the universe since last scattering. This is shown in Figure 19, indicating the degeneracy between the 
dark energy density and the curvature 12^. With WMAP alone the curvature is poorly constrained to 
flk — —0.099lo.ioo’ < 0-76 (95% CL), assuming a Hubble prior of 20 < Ho < 100 and Qa > 0. 

The same degeneracy is seen, although slightly broadened, when the dark energy equation of state is allowed 
to vary. However, in both cases the Hubble constant decreases with increasingly negative curvature, taking 
values inconsistent with observation. This degeneracy can be used to constrain the curvature by combining 
observations (Jungman et al. 1996). In the three-year WMAP analysis, Spergel et al. (2007) showed that 
the degeneracy is truncated with the addition of only one piece of additional cosmological data (Type la 
supernovae, or the HST measurement of the Hubble constant, or galaxy power spectra), tightly constraining 
any deviations from flatness. Komatsu et al. (2008) draw similar conclusions with currently available data, 
and discuss the current limits on the spatial curvature from recent observations. 
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Q a 

Fig. 19. — Left: The points show the set of non-flat models consistent with the WMAP data, colored by the 
Hubble constant values. WMAP measures the acoustic peak scale to high accuracy, but does not constrain 
the curvature, ffy, by itself. However, the highly curved models have a low Hubble constant, inconsistent 
with observation. Right: Constraints on the dark energy equation of state, re, and the dark energy density, 
from WMAP alone. With a Hubble constant H 0 < 100, weak limits can be placed on re in a flat universe, 
shown by the blue contours, but the dark energy density and equation of state are unconstrained (with the 
95% confidence level shaded grey) if the assumption of flatness is relaxed. Limits are significantly improved 
when WMAP is combined with additional data (Komatsu et al. 2008). 

5.2.5. Dark energy properties 

The ACDM model requires a non-zero dark energy density = 0.742 ± 0.030 to fit the data, which is 
assumed to be in the form of a cosmological constant. We do not have an explanation for this component 
of the universe. A natural explanation could be a vacuum energy density (Carroll et al. 1992), but if so, we 
are faced with the fine-tuning problem to explain its observed value, 120 orders of magnitude smaller than 
expected from field theory arguments. Alternative explanations include quintessence (Peebles &; Ratra 1988; 
Wetterich 1988; Ferreira & Joyce 1998) or modifications to gravity (Deffayet et al. 2002). Testing the dark 
energy equation of state today, and as a function of cosmic time will help identify the possible explanation. 

The CMB by itself cannot place strong limits on the equation of state w — p/p, but by measuring the 
acoustic peak positions and heights, and constraining D rn h 2 with the third peak, limits the range of models 
to a degeneracy between and w, shown in Figure 19. The dark energy in these models is allowed to 
cluster. With a prior on the Hubble constant Hq < 100, WMAP alone places weak limits w = -l.06lo.42’ 
with = 0.731 q ii- ^ flatness is not assumed, the WMAP data cannot constrain w or due to the 
geometric degeneracy, also shown in Figure 19. However, the situation is significantly improved when WMAP 
is combined with astronomical data measuring the expansion rate and clustering of matter at late times. 
Komatsu et al. (2008) discuss limits obtained from various data in combination with WMAP, and find w 
constrained to be —1 to within 6% for a flat universe and constant equation of state. 
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6. Conclusions 

The simple six parameter ACDM model continues to fit the WMAP data. With five years of observations, 
we have better measured both the temperature and polarization anisotropy of the CMB. This has allowed 
us to measure with smaller errors, compared to the three-year analysis, the third acoustic peak in the 
temperature spectrum, and the low-^ polarization signal, leading to improved constraints on the cosmological 
parameters describing the contents of the universe, and the primordial fluctuations that seeded structure. 
The observations continue to be well fit by the predictions of the simplest inflationary models, with a scale- 
invariant spectrum of fluctuations disfavored. Consistency with the TE cross-correlation spectrum, now 
measured with better accuracy, provides additional confidence in this simple model. 

We have detected the optical depth to reionization with high significance. This measurement implies 
that reionization of the universe likely took place gradually, as it constrains a sudden reionization to be earlier 
than consistent with other observations. With more data, it will become possible to use the polarization data 
to better quantify the ionization history. Given the improvement in this measurement, and with a view to 
interpreting future large-scale polarization measurements, we develop an alternative way to remove Galactic 
foregrounds from low resolution polarization maps, which includes marginalization over uncertainties in the 
Galactic signal. We find consistent results using this method and the standard template-cleaning method. 

Considering a range of extended models, we continue to find that the standard ACDM model is con- 
sistently preferred by the data. The improved measurement of the third peak now requires the existence 
of light relativistic species, assumed to be neutrinos, at high confidence. The standard scenario has three 
neutrino species, but the three-year WMAP data could not rule out models with none. The ACDM model 
also continues to succeed in fitting a substantial array of other observations. Certain tensions between other 
observations and those of WMAP , such as the amplitude of matter fluctuations measured by weak lensing 
surveys and using the Ly-a forest, and the primordial lithium abundance, have either been resolved with 
improved understanding of systematics, or show promise of being explained by recent observations. With fur- 
ther WMAP observations we will better probe both the universe at a range of epochs, measuring fluctuation 
characteristics to probe the initial inflationary process, or other non- inflat ionary scenario, improving mea- 
surements of the composition of the universe at the recombination era, and characterizing the reionization 
process in the universe. 
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A. Tests of polarization cleaning using Gibbs sampling 

There are a number of prior assumptions made in the Gibbs-cleaning code used to estimate polarized 
CMB and Galactic foregrounds. Here we test their effect on the estimated CMB maps, using the stand-alone 
r code described in Page et al. (2007) as a diagnostic. The value of r is varied, and the exact likelihood 
computed for fixed TT first peak height, and fixed cosmological parameters. 

We do not place priors on the Q and U amplitude of synchrotron emission, but do limit the spectral 
behavior. In the fiducial model we assume a power-law spectral index, with spectral indices defined using the 
spectral index mask in Figure 3. Within each of the 30 regions, the index has a Gaussian prior — 3.0 ± 0.15. 
Changing the mask to have an alternative 30 regions where the sky is divided into equal-sized regions, or 
only 15, has almost no effect on the recovered r (see Table 7), and recovers spectral index maps with similar 
features. The North Polar Spur and Galactic anti-center consistently prefer spectral indices of ~ —3.1, given 
the prior. To use many more regions would require some modification to the current sampling method: 
the index parameters are sampled using Metropolis steps, so many more parameters would take a much 
longer time to converge. Alternative methods have been used in e.g. Eriksen et al. (2007) to sample index 
parameters. Since the Gaussian prior is rather tight, we test its effect by both moving the central value to 
-2.8, and widening the range to ±0.2. When it is broadened, the convergence is poor unless we also impose 
a hard prior of (3 > —2.5. In this case we find r = 0.096 ± 0.18, close to the fiducial value. To use broader 
priors it may be necessary to impose alternative priors on the indices, such as Jeffrey’s priors (Eriksen et al. 
2007). 

We make assumptions about both the spectral index and amplitude of the dust emission. The index 
is fixed at (3d = 1.7 in the fiducial model. Changing it to (3 = 2 has no effect on the recovered r value, as 
shown in Table 7. We also impose priors on the amplitudes of the dust Q and U Stokes vectors. In the 
fiducial case the amplitude of Q and U in each pixel, defined at W band, has a Gaussian prior 0 ± 0.2 /^(n), 
where Id is the intensity of dust from the FDS map, as well as a hard bound on the polarization amplitude, 
P(n ) < 0.2 Id(n). If the upper limit is raised to 50% in both cases, much higher than expected theoretically, 
there is a stronger degeneracy between the CMB and dust components, and the error on r is raised and 
the central value lowered, with r = 0.090 ± 0.021. This effect increases if we set the limit at 100%, or fully 
polarized, in which case r = 0.085 ±0.025. We also checked with the three-year data that varying the central 
value of the prior to Q — 0.05 cos(27 k)/^, U = 0.05 sin(27K)/<2, where 7 k are the K-band polarization angles, 


Description 

T 

Fiducial 

0.101 ±0.017 

Index mask 2 (30 regions) 

0.098 ±0.017 

Index mask 3 (15 regions) 

0.100 ±0.017 

ft = -2.8 ±0.15 

0.103 ±0.018 

ft = -3.0 ±0.2 

0.096 ±0.018 

ft = 2 

0.100 ±0.017 

Pd < 0.5 I d 

0.090 ±0.021 

Pd < Id 

0.085 ± 0.025 

KKaQVW 

0.093 ±0.016 


Table 7: Limits on the optical depth, r, for a variety of assumptions made for Gibbs sampling the low 
resolution polarization maps. 
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Fig. 20. — The angular power spectra of signal and noise components in the smoothed ILC map used for 
Gibbs sampling the low-^ temperature spectrum. Uncorrelated noise, at 2 fiK per pixel (red), is added to 
the smoothed ILC map to speed up the sampling, and then is assumed to be the only noise present. This 
assumption is inaccurate at low £, as it ignores the true noise (blue), but the error is negligible since it is 
significantly lower than cosmic variance (green). 


had a negligible effect on the results. In the absence of a good prior for the polarization amplitudes (e.g. 
from higher frequency observations), this highlights the importance of the dust intensity map as a means of 
limiting the polarized dust contribution. 

Finally, we use the KKaQV data combination for our fiducial result, but find similar limits on the optical 
depth when W band is included, with r = 0.093 ± 0.016. However, we choose not to use W band in the 
standard analysis, due to concerns about systematic effects (Hinshaw et al. 2008). 


B. Low-£ TT likelihood cross checks 

We have switched from the pixel likelihood to a Blackwell-Rao (BR) estimator at low ell, so perform 
several tests to verify that the new likelihood 1) is consistent with the pixel likelihood, 2) is insensitive to 
several choices of input data, and 3) is properly converged. Many of our checks involve comparing likelihoods. 
Because the likelihood is a function on a high dimensional space, we do not check it everywhere. Instead 


resolution parameter 

4 

5 

smoothing FWHM 

9.1831° 

5.0° 

0-noise/pixel [/i K] 

1.0 

2.0 

£ m ax sampled 

32 

51 

^max conditioned 

48 

96 


Table 8: Parameters used for sampling the low-£ TT likelihood at two different resolutions. 
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Multipole moment l 


Fig. 21. — The level of variation in the low-ell TT Gibbs likelihood that arises from different input parameters. 
The spectrum estimated from the standard input (black) is compared to results obtained using a different 
white noise realization (blue), using a larger mask (KQ80,red), and degrading to N s \^ e = 16 (green) rather 
then 32. The likelihood does fluctuate with these changes, but has a negligible effect on cosmological 
parameters. The values shown at each multipole correspond to maximum likelihood values obtained by 
fixing the spectrum of other multipoles at fiducial values. The error bars show where the likelihood is at 
50% and 5% of its peak value. 

we choose some fiducial spectrum and compute slices through the likelihood. We compare these conditional 
likelihoods, typically in the form of maximum likelihood values and full- width-half-max error bars between 
different low-£ likelihood methods. We also compare ACDM cosmological parameters obtained using the 
different inputs. 

The first comparison is between the pixel-based code and the Gibbs code. Figure 4 showed a compar- 
ison of the Ci likelihoods, with no significant differences, even though they use different resolutions. The 
cosmological parameters are unchanged for the two codes. Next, the Gibbs sampling can use maps degraded 
to N s [de = 16 or 32 (or higher resolution). Table 8 lists details of runs at these two different resolutions, 
including the standard deviation of Gaussian white noise added to the smoothed maps. The power spectrum 
of the white noise for the iV side = 32 case, is compared to the input noise and cosmic variance in Figure 
20. Power spectra are sampled up to £ max (sampled), and the sampled CMB skies are constructed with the 
sampled power spectra and by conditioning on a constant fiducial spectrum out to a higher value of ^ max 
(conditioned). In Figure 21 the estimated spectra are compared, using the fiducial KQ85 mask. There are 
some small differences, but the spectra are consistent. 

We also show in Figure 21 that the spectra obtained are almost identical for two different realizations for 
the added uncorrelated white noise, and that using the larger KQ80 mask, compared to the fiducial KQ85, 
has a small effect on the spectra and cosmological parameters, consistent with noise. Finally we compare 
results for different ^ max used in the BR estimator. The standard set-up uses ^ max — 32; using the low-^ 
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likelihood up to £ = 51 has almost no effect on cosmological parameters. 


C. Parameter estimation details 


C.l. Sampling method 


In the primary pipeline, only one chain is run for each model, rather than the typical four or eight 
parallel chains used in most MCMC cosmological analyses. This is possible as our spectral convergence test 
only requires a single chain. The starting points are picked as good-fitting points from previous analyses 
(e.g., a good-fitting value from the three- year WMAP analysis), or previous test chains, which means that no 
initial burn-in need be removed. Starting afresh with the five-year data, one can always use a point lying in 
the WMAP three-year limits as a starting point. This would not be the case for entirely new distributions, 
or much improved data, in which case we run a short initial exploratory chain to find the high likelihood 
region. 

The covariance matrix from which trial steps are drawn, is chosen using a best guess for the covariance 
C of the distribution being sampled. For a multi- variate Gaussian with covariance C, the optimal trial 
covariance matrix is Ct = (<7t/cto) 2 C, where 


&T 2.4 
c To \[D 


(Cl) 


for D-dimensional distributions (Hanson & Cunningham 1998; Dunkley et al. 2005). This relation also holds 
for somewhat non- Gaussian distributions. At each step we draw a vector G of D Gaussian unit variance 
zero mean random variates, and compute a trial set of parameters Xt, starting from the current position x$, 
where 

xt = Xi -h VC tG. (C2) 

Using an appropriate covariance matrix can speed up the sampling time by a factor of hundreds. In practice 
we have a good idea of the covariance from previous cosmological analyses, so this is always used as a starting 
point. With significantly different data, or with new parameters, a best guess is made. The chains are then 
run for a few thousand steps, and then updated if the matrix is inadequate. This is determined by the 
acceptance rate of the chain, which should be ~ 15 — 25% and by the chain efficiency, using the spectral 
test described in the next section. A second or third update may be required for models such as those with 
curved geometries with variable dark energy equation of state. Once a good covariance matrix is found, the 
chains are run for typically 20,000 steps, and then tested for convergence. Convergence for ACDM chains 
typically takes place after only ~6000 iterations, although at this point the ID and 2D distributions are 
noisy. In practice we then run all the chains for longer than the convergence limit, typically for 100,000 
iterations, in order to get well sampled histograms. The chains are not thinned before analysis. 


C.2. Spectral convergence test 

A chain has ‘converged’ when its statistical properties reflect those of the underlying distribution with 
sufficient accuracy. To determine this stopping point, we use the spectral convergence test described in 
Dunkley et al. (2005) in our main parameter pipeline. The power spectrum of each parameter of the chain 
is used as a diagnostic, to check whether the chain has (1) sampled the distribution in such a way that it 
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is unbiased by correlations, and (2) sampled enough points that statistics can be estimated with sufficient 
accuracy. The Gelman & Rubin test (Gelman & Rubin 1992), commonly used in cosmology, can sometimes 
fail to test the first point, producing a false positive. 

To estimate the power spectrum P(k) from a chain of length N we construct Pj = | a, ^ | 2 , where j = 
27 rk/N, by taking the discrete Fourier Transform of the chain of values for each parameter, x, 


N - 1 


a ] = -J= y] x(n) exp[i2ir(jn/W)] 

V 71 = 0 


(C3) 


where —(N/2 — 1) < j < N/2. x(n) is the value at each iteration n, so chains stored in weighted format 
are converted to unweighted arrays for analysis. Since the Metropolis algorithm produces chains which are 
correlated on small scales, the power spectrum tends to a white noise spectrum on large scales, and turns over 
to a spectrum with suppressed power at large k , with the turnover position reflecting the inverse correlation 
length. In Dunkley et al. (2005) it is shown that the spectrum can be fit by the following template: 


P(k) = Po 


(k*/k) a 
(k* /k) a + 1 ’ 


(C4) 


with Po giving the amplitude of the white noise spectrum in the k — ► 0 limit, k* indicates the position 
of the turnover to a different power law behavior, characterized by a, at large k. This model is shown in 
Figure 3 of Dunkley et al. (2005), fitting the noisy spectrum of a parameter from a chain. The model fits 
the noise- averaged spectrum of a real chain obtained from Monte-Carlo simulations, also shown in Dunkley 
et al. (2005). 

To fit the parameters InPo, k* and a to Pj using least squares for a finite chain, we have 


In Pj = ln[Po] 4- In 


(Nk*/2nj) a 
1 + (Nk* /27rj) a 


-7 + r j, 


(C5) 


where 7 = 0.577216 is the Euler-Mascheroni constant, and r 3 are random measurement errors. The parame- 
ters are fit over the range of Fourier modes 1 < j < 10j*, for a spectrum with j* = k*(N/2ir ), so we iterate 
twice to converge on the j* limit. 

For convergence, the largest scales probed must be in the white noise regime P(k) ~ A: 0 , defined by 
the requirement j* > 20 for each parameter. This insures that the correlated points are not biasing the 
distribution and indicates that the chain is drawing points throughout the full region of high probability. To 
test sufficient accuracy, we require the convergence ratio r = ct^/o-q to be less than 1% for each parameter, 
where is the variance in the sample mean, and ctq is the variance of the sampled parameter. The Gelman 
& Rubin test incorporates a similar ratio: their R statistic roughly translates to P ~ 1 + r, but the quantity 
is calculated using multiple parallel chains. It is shown in Dunkley et al. (2005) that r can be estimated 
using a single chain, since estimating the sample mean variance of a long chain, with zero mean, is equivalent 
to estimating P(k) at k = 0: 


= 7 2 ) ~ • p ( fc = °)- 


(C6) 


In practice we rescale each parameter to have zero mean and unit variance before computing its power 
spectrum. Then we estimate the value of Po for each parameter, to compute r — Po/N . We require r < 0.01, 
but in practice obtain much smaller values, typically with r < 0.001. 
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